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95 Human Secreted Proteins 
Field of the Invention 

This invention relates to newly identified polynucleotides and the 
polypeptides encoded by these polynucleotides, uses of such polynucleotides and 
5 polypeptides, and their production. 

Background of the Invention 
Unlike bacterium, which exist as a single compartment surrounded by a 
membrane, human cells and other eucaryotes are subdivided by membranes into many 
functionally distinct compartments. Each membrane-bounded compartment, or 
10 organelle, contains different proteins essential for the function of the organelle. The 
cell uses "sorting signals," which are amino acid motifs located within the protein, to 
target proteins to particular cellular organelles. 

One type of sorting signal, called a signal sequence, a signal peptide, or a 
leader sequence, directs a class of proteins to an organelte called the endoplasmic 
15 reticulum (ER). The ER separates the membrane-bounded proteins from all other 
types of proteins. Once localized to the ER, both groups of proteins can be further 
directed to another organelle called the Golgi apparatus. Here, the Golgi distributes 
the proteins to vesicles, including secretory vesicles, the cell membrane, lysosomes, 
and the other organelles. 
20 Proteins targeted to the ER by a signal sequence can be released into the 

extracellular space as a secreted protein. For example, vesicles containing secreted 
proteins can fuse with the cell membrane and release their contents into the 
extracellular space - a process called exocytosis. Exocytosis can occur constitutively 
or after receipt of a triggering signal. In the latter case, the proteins are stored in 
25 secretory vesicles (or secretory granules) until exocytosis is triggered. Similarly, 
proteins residing on the cell membrane can also be secreted into the extracellular 

space~by^rotedlyti"c^leavage r of a 4 1iril^r M holding the proteinTto the membrane: 

Despite the great progress made in recent years, only a small number of genes 
encoding human secreted proteins have been identified. These secreted proteins 
30 include the commercially valuable human insulin, interferon, Factor VIII, human 
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growth hormone, tissue plasminogen activator, and erythropoeitin. Thus, in light of 
the pervasive role of secreted proteins in human physiology, a need exists for 
identifying and characterizing novel human secreted proteins and the genes that 
encode them. This knowledge will allow one to detect, to treat, and to prevent 
medical disorders by using secreted proteins or the genes that encode them. 

Summary of the Invention 

The present invention relates to novel polynucleotides and the encoded 
polypeptides. Moreover, the present invention relates to vectors, host cells, 
antibodies, and recombinant methods for producing the polypeptides and 
polynucleotides. Also provided are diagnostic methods for detecting disorders related 
to the polypeptides, and therapeutic methods for treating such disorders. The 
invention further relates to screening methods for identifying binding partners of the 
polypeptides. 

Detailed Description 

Definitions 

The following definitions are provided to facilitate understanding of certain 
terms used throughout this specification. 

In the present invention, "isolated" refers to material removed from its original 
environment (e.g., the natural environment if it is naturally occurring), and thus is 
altered "by the hand of man" from its natural state. For example, an isolated 
polynucleotide could be part of a vector or a composition of matter, or could be 
contained within a cell, and still be "isolated" because that vector, composition of 
matter, or particular cell is not the original environment of the polynucleotide. 

In the present invention, a "secreted" protein refers to those proteins capable 
of being directed to the ER, secretory vesicles, or the extracellular space as a result of 
~asignal s^uer^^, aswell as those proteins released into the extracellular space 
without necessarily containing a signal sequence. If the secreted protein is released 
into the extracellular space, the secreted protein can undergo extracellular processing 
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to produce a "mature" protein. Release into the extracellular space can occur by many 
mechanisms, including exocytosis and proteolytic cleavage. 

In specific embodiments, the polynucleotides of the invention are less than 
300 kb, 200 kb, 100 kb, 50 kb, 15 kb, 10 kb, or 7.5 kb in length. In a further 
5 embodiment, polynucleotides of the invention comprise at least 15 contiguous 
nucleotides of the coding sequence, but do not comprise all or a portion of any intron. 
In another embodiment, the nucleic acid comprising the coding sequence does not 
contain coding sequences of a genomic flanking gene (i.e., 5* or 3' to the gene in the 
genome). 

10 As used herein , a "polynucleotide" refers to a molecule having a nucleic acid 

sequence contained in SEQ ID NO:X or the cDNA contained within the clone 
deposited with the ATCC. For example, the polynucleotide can contain the 
nucleotide sequence of the full length cDNA sequence, including the 5' and 3* 
untranslated sequences, the coding region, with or without the signal sequence, the 

15 secreted protein coding region, as.weH~as fragments, epitopes, domains, and variants 
of the nucleic acid sequence. Moreover, as used herein, a "polypeptide" refers to a 
molecule having the translated amino acid sequence generated from the 
polynucleotide as broadly defined. 

In the present invention, the full length sequence identified as SEQ ID NO:X 

20 was often generated by overlapping sequences contained in multiple clones (contig 
analysis). A representative clone containing all or most of the sequence for SEQ ID 
NO:X was deposited with the American Type Culture Collection ("ATCC"). As 
shown in Table 1 , each clone is identified by a cDN A Clone ID (Identifier) and the 
ATCC Deposit Number. The ATCC is located at 10801 University Boulevard, 

25 Manassas, Virginia 201 10-2209, USA. The ATCC deposit was made pursuant to the 
terms of the Budapest Treaty on the international recognition of the deposit of 
microorganisms for purposes of patent procedure. 

A-"polynucleotide"-of thepresentinvention alsoincludes those 

polynucleotides capable of hybridizing, under stringent hybridization conditions, to 

30 sequences contained in SEQ ID NO:X, the complement thereof, or the cDNA within 
the clone deposited with the ATCC. "Stringent hybridization conditions" refers to an 
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overnight incubation at 42° C in a solution comprising 50% formamide, 5x SSC (750 
mM NaCI, 75 mM sodium citrate), 50 mM sodium phosphate (pH 7.6), 5x Denhardt's 
solution, 10% dextran sulfate, and 20 M-g/ml denatured, sheared salmon sperm DNA, 
followed by washing the filters in O.lx SSC at about 65°C 

Also contemplated are nucleic acid molecules that hybridize to the 
polynucleotides of the present invention at lower stringency hybridization conditions. 
Changes in the stringency of hybridization and signal detection are primarily 
accomplished through the manipulation of formamide concentration (lower 
percentages of formamide result in lowered stringency); salt conditions, or 
temperature. For example, lower stringency conditions include an overnight 
incubation at 37°C in a solution comprising 6X SSPE (20X SSPE = 3M NaCI; 0.2M 
NaH 2 P0 4 ; 0.02M EDTA, pH 7.4), 0.5% SDS, 30% formamide, 100 ug/ml salmon 
sperm blocking DNA; followed by washes at 50°C with 1XSSPE, 0.1% SDS. In 
addition, to achieve even lower stringency, washes performed following stringent 
hybridization can be "done at higher salt concentrations (e.g. 5X SSC). 

Note that variations in the above conditions may be accomplished through the 
inclusion and/or substitution of alternate blocking reagents used to suppress 
background in hybridization experiments. Typical blocking reagents include 
Denhardt's reagent, BLOTTO, heparin, denatured salmon sperm DNA, and 
commercially available proprietary formulations. The inclusion of specific blocking 
reagents may require modification of the hybridization conditions described above, 
due to problems with compatibility. 

Of course, a polynucleotide which hybridizes only to polyA+ sequences (such 
as any 3' terminal polyA+ tract of a cDNA shown in the sequence listing), or to a 
complementary stretch of T (or U) residues, would not be included in the definition of 
"polynucleotide," since such a polynucleotide would hybridize to any nucleic acid 
molecule containing a poly (A) stretch or the complement thereof (e.g., practically 
any double-stranded cDNAdone). — ~~ 

The polynucleotide of the present invention can be composed of any 
polyribonucleotide or polydeoxribonucleotide, which may be unmodified RNA or 
DNA or modified RNA or DNA. For example, polynucleotides can be composed of 
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single- and double-stranded DNA, DNA that is a mixture of single- and double- 
stranded regions, single- and double-stranded RNA, and RNA that is mixture of 
single- and double-stranded regions, hybrid molecules comprising DNA and RNA 
that may be single-stranded or, more typically, double-stranded or a mixture of single- 
and double-stranded regions. In addition, the polynucleotide can be composed of 
triple-stranded regions comprising RNA or DNA or both RNA and DNA. A 
polynucleotide may also contain one or more modified bases or DNA or RNA 
backbones modified for stability or for other reasons. "Modified" bases include, for 
example, tritylated bases and unusual bases such as inosine. A variety of 
modifications can be made to DNA and RNA; thus, "polynucleotide" embraces 
chemically, enzymatically, or metabolically modified forms. 

The polypeptide of the present invention can be composed of amino acids 
joined to each other by peptide bonds or modified peptide bonds, i.e., peptide 
isosteres, and may contain amino acids other than the 20 gene-encoded amino acids. 
The polypeptides may be modified by either natural processes, such as 
posttranslational processing, or by chemical modification techniques which are well 
known in the art. Such modifications are well described in basic texts and in more 
detailed monographs, as well as in a voluminous research literature. Modifications 
can occur anywhere in a polypeptide, including the peptide backbone, the amino acid 
side-chains and the amino or carboxyl termini. It will be appreciated that the same 
type of modification may be present in the same or varying degrees at several sites in 
a given polypeptide. Also, a given polypeptide may contain many types of 
modifications. Polypeptides may be branched , for example, as a result of 
ubiquitination, and they may be cyclic, with or without branching. Cyclic, branched, 
and branched cyclic polypeptides may result from posttranslation natural processes or 
may be made by synthetic methods. Modifications include acetylation, acylation, 
ADP-ribosylation, amidation, covalent attachment of flavin, covalent attachment of a 

heme-moiety^-covalent-attachment of a nucleotide or nucleotide derivative; covalent 

attachment of a lipid or lipid derivative, covalent attachment of phosphotidylinositol, 
cross-linking, cyclization, disulfide bond formation, demethylation, formation of 
covalent cross-links, formation of cysteine, formation of pyroglutamate, formylation, 
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gamma-carboxylation, glycosylation, GPI anchor formation, hydroxylation, 
iodination, methylation, myristoylation, oxidation, pegylation, proteolytic processing, 
phosphorylation, prenylation, racemization, selenoylation, sulfation, transfer-RNA 
mediated addition of amino acids to proteins such as arginylation, and ubiquitination. 
(See, for instance, PROTEINS - STRUCTURE AND MOLECULAR PROPERTIES, 
2nd Ed., T. E. Creighton, W. H. Freeman and Company, New York (1993); 
POSTTRANSLATIONAL COVALENT MODIFICATION OF PROTEINS, B. C. 
Johnson, Ed., Academic Press, New York, pgs. 1-12 (1983); Seifter et ah, Meth 
Enzymol 182:626-646 (1990); Rattan et al., Ann NY Acad Sci 663:48-62 (1992).) 

"SEQ ID NO:X M refers to a polynucleotide sequence while "SEQ ID NO: Y M 
refers to a polypeptide sequence, both sequences identified by an integer specified in 
Table L 

"A polypeptide having biological activity" refers to polypeptides exhibiting 
activity similar, but not necessarily identical to, an activity of a polypeptide of the 
present invention, including mature forms, as measured in a particular biological 
assay, with or without dose dependency. In the case where dose dependency does 
exist, it need not be identical to that of the polypeptide, but rather substantially similar 
to the dose-dependence in a given activity as compared to the polypeptide of the 
present invention (Le., the candidate polypeptide will exhibit greater activity or not 
more than about 25-fold less and, preferably, not more than about tenfold less 
activity, and most preferably, not more than about three-fold less activity relative to 
the polypeptide of the present invention.) 

Polynucleotides and Polypeptides of the Invention 

FEATURES OF PROTEIN ENCODED BY GENE NO: 1 

This gene is expressed primarily in anergic T cells and merkel cells. 

Therefore, polynucleotides and polypeptides of the ihventiblTafe~useful~as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to, immune disorders and inflammatory diseases. Similarly, polypeptides 
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and antibodies directed to these polypeptides are useful in providing immunological 
probes for differential identification of the tissue(s) or cell type(s). For a number of 
disorders of the above tissues or cells, particularly of the immune system, expression 
of this gene at significantly higher or lower levels may be routinely detected in certain 
5 tissues or cell types (e.g., immune, cancerous and wounded tissues) or bodily fluids 
(e.g., lymph, serum, plasma, urine, synovial fluid and spinal fluid) or another tissue or 
cell sample taken from an individual having such a disorder, relative to the standard 
gene expression level, i.e., the expression level in healthy tissue or bodily fluid from 
an individual not having the disorder. 
10 Preferred epitopes include those comprising a sequence shown in SEQ ID NO: 

108 as residues: Ala-55 to Gln-64. 

The tissue distribution in T-cells and merkel cells indicates that the protein 
products of this gene are useful for the diagnosis and/or treatment of immune system 
diseases. Furthermore, 

1 5 Expressionof this gene product in T-cells indicates a role in the regulation of 

the proliferation; survival; differentiation; and/or activation of potentially all 
hematopoietic cell lineages, including blood stem cells. This gene product may be 
involved in the regulation of cytokine production, antigen presentation, or other 
processes that may also suggest a usefulness in the treatment of cancer (e.g. by 

20 boosting immune responses). 

Since the gene is expressed in cells of lymphoid origin, the gene or protein, as 
well as, antibodies directed against the protein may show utility as a tumor marker 
and/or immunotherapy -targets for the above listed tissues. Therefore it may be also 
used as an agent for immunological disorders including arthritis, asthma, immune 

25 deficiency diseases such as AIDS, leukemia, rheumatoid arthritis, inflammatory 
bowel disease, sepsis, acne, and psoriasis. In addition, this gene product may have 
commercial utility in the expansion of stem cells and committed progenitors of 

-various blood lineagesy and in-the differentiation and/or proliferation of various cell 

types. Protein, as well as, antibodies directed against. the protein may show utility as a 

30 tumor marker and/or immunotherapy targets for the above listed tissues. 
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Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO: 1 1 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
5 excluded from the scope of the present invention. To list every related sequence is 
cumbersome. Accordingly, preferably excluded from the present invention are one or 
more polynucleotides comprising a nucleotide sequence described by the general 
formula of a-b, where a is any integer between 1 to 2329 of SEQ ID NO: 1 1, b is an 
integer of 15 to 2343, where both a and b correspond to the positions of nucleotide 
10 residues shown in SEQ ID NO: 1 1 , and where b is greater than or equal to a + 14. . 

FEATURES OF PROTEIN ENCODED BY GENE NO: 2 

In specific embodiments, polypeptides of the invention comprise the following 
amino acid sequences: IPENRRPASXCTWSMWTSRTTTRRPPWGRFSSVSSASV 
1 5 SSTRKTWRTRSTSCCRSSRRRV A APFCTPS ASTEPS ARMEPPLELP V VHTFSFL 
TFVFTYRCS AGDGSITQINCAYEMGEEMPKRQMKAIKFLLFHFYL (SEQ ID 
NO:205), IPENRRPASXCTWSMWTSRTTTRRPPWGRFSSVSSASVSST (SEQ ID 
NO:206), RKTWRTRSTSCCRSSRRRVAAPFCTPSASTEPSARMEPPLELP (SEQ 
ID NO:207), and/or VVHTFSFLTFVFTYRCSAGDGSITQINCAYEMGEEMPKRQ 
20 MKAIKFLLFHFYL (SEQ ID NO:208). Polynucleotides encoding these polypeptides 
are also encompassed by the invention. 

This gene is expressed primarily in placental, brain and breast tissues, and to a 
lesser extent in T cells and tumors. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
25 reagents for differential identification of the tissue(s) or cell type(s) present in a 

biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to, neurodegenerative and/or endocrine disorders and neoplasias, or 

developmental-disorders. Similarly , -polypeptides and-antibodies directed to these 

polypeptides are useful in providing immunological probes for differential 
30 identification of the tissue(s) or cell type(s). For a number of disorders of the above 
tissues or cells, particularly of the neurodegenerative, developing, endocrine and 
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immune systems, expression of this gene at significantly higher or lower levels may 
be routinely detected in certain tissues or cell types (e.g., brain, endocrine, immune, 
developing, cancerous and wounded tissues) or bodily fluids (e.g., lymph, serum, 
plasma, urine, synovial fluid and spinal fluid) or another tissue or cell sample taken 
from an individual having such a disorder, relative to the standard gene expression 
level, i.e., the expression level in healthy tissue or bodily fluid from an individual not 
having the disorder. 

Preferred epitopes include those comprising a sequence shown in SEQ ID NO: 
109 as residues: Ala-55 to Asn-60, Lys-65 to Met-71, Leu-75 to Asn-86, Asp-93 to 
Asp-1 10, Leu-130 to Cys-138, Gln-149 to Glu-154, Thr-172 to Ile-179, Glu-185 to 
Arg-192. 

The tissue distribution in breast and brain tissues indicates that the protein 
products of this gene are useful for the diagnosis and/or treatment of endocrine 
disorders, neurodegenerative disorders, developmental disorders, immune system 
diseases and neoplasias. The tissue distribution in placental tissue indicates that 
polynucleotides and polypeptides corresponding to this gene are useful for the 
diagnosis and/or treatment of disorders of the placenta. Specific expression within the 
placenta indicates that this gene product may play a role in the proper establishment 
and maintenance of placental function. Alternately, this gene product may be 
produced by the placenta and then transported to the embryo, where it may play a 
crucial role in the development and/or survival of the developing embryo or fetus. 

Expression of this gene product in a vascular-rich tissue such as the placenta 
also indicates that this gene product may be. produced more generally in endothelial 
cells or within the circulation. In such instances, it may play more generalized roles in 
vascular function, such as in angiogenesis. It may also be produced in the vasculature 
and have effects on other cells within the circulation, such as hematopoietic cells. It 
may serve to promote the proliferation, survival, activation, and/or differentiation of 
hematopoietic-cells, as -well-as other cells-throughout the body^ Likewise, — 

Expression of this gene product in T-cells indicates a role in the regulation of 
the proliferation; survival; differentiation; and/or activation of potentially all 
hematopoietic cell lineages, including blood stem cells. This gene product may be 
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involved in the regulation of cytokine production, antigen presentation, or other 
processes that may also suggest a usefulness in the treatment of cancer (e.g. by 
boosting immune responses). 

Since the gene is expressed in cells of lymphoid origin, the gene or protein, as 
well as, antibodies directed against the protein may show utility as a tumor marker 
and/or immunotherapy targets for the above listed tissues. Therefore it may be also 
used as an agent for immunological disorders including arthritis, asthma, immune 
deficiency diseases such as AIDS, leukemia, rheumatoid arthritis, inflammatory 
bowel disease, sepsis, acne, and psoriasis. In addition, this gene product may have 
commercial utility in the expansion of stem cells and committed progenitors of 
various blood lineages, and in the differentiation and/or proliferation of various cell 
types. Alternatively, the tissue distribution in brain tissue indicates that 
polynucleotides and polypeptides corresponding to this gene are useful for the 
detection/treatment of neurodegenerative disease states and behavioural disorders 
such as Alzheimers Disease, Parkinsons Disease, Huntingtons Disease, Tourette 
Syndrome, schizophrenia, mania, dementia, paranoia, obsessive compulsive disorder, 
panic disorder, learning disabilities, ALS, psychoses, autism, and altered behaviors, 
including disorders in feeding, sleep patterns, balance, and perception. In addition, the 
gene or gene product may also play a role in the treatment and/or detection of 
developmental disorders associated with the developing embryo, or sexually-linked 
disorders. Protein, as well as, antibodies directed against the protein may show utility 
as a tumor marker and/or immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO: 12 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence is 
cumbersomerAccordingly; preferably excluded fronrthe present inverition^afelDn^r 
more polynucleotides comprising a nucleotide sequence described by the general 
formula of a-b, where a is any integer between 1 to 1 163 of SEQ ID NO: 12, b is an 
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integer of 15 to 1 177, where both a and b correspond to the positions of nucleotide 
residues shown in SEQ ID NO: 12, and where b is greater than or equal to a + 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 3 
5 The translation product of this gene shares sequence homology with bovine 

beta-mannosidase, which is thought to be important in lysosomal catabolism of 
glycoproteins. See, for example, J. Biol. Chem. 270, 3841-3848 (1995), incorporated 
herein by reference in its entirety. Based on the sequence similarity between these 
proteins the translation product of this gene will sometimes hereinafter be reffered to 

10 as human beta-mannosidase. Human beta-mannosidase is expected to share certain 
biological activities, particularly enzymatic activities, with bovine beta-mannosidase. 
Such activities may be assayed by methods known in the art, described in J. Biol. 
Chem. 270, 3841-3848 (1995), and/or disclosed elsewhere herein. 

In specific embodiments, polypeptides of the invention comprise the following 

15 amino acid sequences: HPSIWSGNNENEEALMMNWYHISFTDRPIYIKDYVTL 
YVKNIRELVLAGDKSRPFITSSPTNGAETVAEAWVSQNPNSNYFGDVHFYDYI 
SDCWNWKVFPKARFASEYGYQSWPSFSTLEKVSSTEDWSFNSKFSLHRQHH 
EGGNKQMLYQAGLHFKLPQSTDPLRTFKDTIYLTQVMQAQCVKTETEFYRRS 
RSEIVDQQGHTMGALYWQLNDIWQAPSW (SEQ ID NO:209), and/or 

20 VRVHTWS 

SLEPVCSRVTERFVMKGGEAVCLYEEPVSELLRRCGNCTRESCVVSFYLSAD 

HELLSPTNYHFLSSPKEAVGLCKAQrTAIISQQGDIFWDLETSAVAPFVWLDV 

GSIPGRFSDNGFLMTEKTRTILFYPWEPTSKNELEQSFHVTSLTDIY (SEQ ID 
. NO:210). Polynucleotides encoding these polypeptides are also encompassed by the 
25 invention. The gene encoding the disclosed cDNA is thought to reside on 

chromosome 4. Accordingly, polynucleotides related to this invention are useful as a 

marker in linkage analysis for chromosome 4. 
._ _ -q^is g ene j s expressedprimarily in colon tissuerand to alesser extentin 

thymus stromal cells and chondrosarcoma tissue. 
30 Therefore, polynucleotides and polypeptides of the invention are useful as 

reagents for differential identification of the tissue(s) or cell type(s) present in a 
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biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to, chondroma and mannosidosis. Similarly, polypeptides and antibodies 
directed to these polypeptides are useful in providing immunological probes for 
differential identification of the tissue(s) or cell type(s). For a number of disorders of 
5 the above tissues or cells, particularly of the chondro and immune system. The 
expression of this gene at significantly higher or lower levels may be routinely 
detected in certain tissues or cell types (e.g., immune, metabolic, cancerous and 
wounded tissues) or bodily fluids (e.g., lymph, serum, plasma, urine, synovial fluid 
and spinal fluid) or another tissue or cell sample taken from an individual having such 

10 a disorder, relative to the standard gene expression level, i.e., the expression level in 
healthy tissue or bodily fluid from an individual not having the disorder. 

The tissue distribution and homology to bovine beta-mannosidase indicates 
that the protein products of this gene are useful for the diagnosis and/or treatment of 
chondroma and mannosidosis. Human beta-mannosidosis is an autosomal recessive, 

15 lysosomal storage dfsease caused by a deficiency of the enzyme beta-mannosidase. 
Furthermore, the homology of the translation product of this gene to beta- 
mannosidase indicates that polynucleotides and polypeptides corresponding to this 
gene are useful for the diagnosis, prevention, and/or treatment of various metabolic 
disorders such as lysosomal storage deficiencies, Tay-Sachs disease, 

20 phenylkenonuria, galactosemia, hyperlipidemias, porphyrias, and Hurler's syndrome. 
Protein, as well as, antibodies directed against the protein may show utility as a tumor 
marker and/or immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly- 
available and accessible through sequence databases. Some of these sequences are 

25 related to SEQ ID NO: 13 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence is 
cumbersome. Accordingly, pfeferaBly~excl'i^^ 

more polynucleotides comprising a nucleotide sequence described by the general 
30 formula of a-b, where a is any integer between 1 to 2093 of SEQ ID NO: 13, b is an 
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integer of 15 to 2107, where both a and b correspond to the positions of nucleotide 
residues shown in SEQ ID NO: 13, and where b is greater than or equal to a + 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 4 

In specific embodiments, polypeptides of the invention comprise the following 
amino acid sequences: PRLTPRMKWPTAALASRLLGWTVLRPPYPRVPSLPQVT 
LHPTDGLMAVLYTGGEGRTLGEQHFFHETFVTRWLLGPVPVRFGACSPLSFL 
APRRGQGAPAGXFCACPRPASRQLCPWPALPGTPYSNSAPLCTGMGHSNTPQ 
GPPSPQY ALSPTEPTSLSGNSHLPAILVL (SEQ ID NO:211), 
PRLTPRMKWPTAAL ASRLLGWTVLRPPYPRVPSLPQVTLHP (SEQ ID 
NO:212), TDGLMAVLYTGGE GRTLGEQHFFHETFVTRWLLGPVPVRFG (SEQ 
ID NO:213), ACSPLSFLAPRRGQGAPAGXFCACPRPAS RQLCPWPALPGTP 
(SEQ ID N O : 2 1 4 ) , and/or 

YS NS APLCTGMG H S NTPQG PPS P Q Y ALS PTEPTS LS GNS HLPAILVL (SEQ ID 
NO:215). Polynucleotides encoding these polypeptides are also encompassed by the 
invention. 

This gene is expressed primarily in human lung (adult and fetal), and to a 
lesser extent in liver and brain tissues. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to, pulmonary disorders and hemostasis. Similarly, polypeptides and 
antibodies directed to these polypeptides are useful in providing immunological 
probes for differential identification of the tissue(s) or cell type(s). For a number of 
disorders of the above tissues or cells, particularly of the lung and liver tissues, 
expression of this gene at significantly higher or lower levels may be routinely 
detected in certain tissues or cell types (e.g., pulmonary, cancerous and wounded 
tissues) or-bodily-fluids-(e7gr,-lymph,-sera spinaT ~ 

fluid) or another tissue or cell sample taken from an individual having such a 
disorder, relative to the standard gene expression level, i.e., the expression level in 
healthy tissue or bodily fluid from an individual not having the disorder. 
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Preferred epitopes include those comprising a sequence shown in SEQ ID NO: 
1 1 1 as residues: Arg-28 to Gln-36. 

The tissue distribution in lung and liver tissues indicates that the protein 
products of this gene are useful for the diagnosis and/or treatment of pulmonary 
disorders and hematopoietic disorders. The tissue distribution in adult and fetal lung 
tissues indicates that polynucleotides and polypeptides corresponding to this gene are 
useful for the detection and treatment of disorders associated with developing lungs, 
particularly in premature infants where the lungs are the last tissues to develop. The 
tissue distribution indicates that polynucleotides and polypeptides corresponding to 
this gene are useful for the diagnosis and intervention of lung tumors, since the gene 
may be involved in the regulation of cell division, particularly since it is expressed in 
fetal tissue. Alternatively, 

Expression of this gene product in liver tissue indicates a role in the regulation 
of the proliferation; survival; differentiation; and/or activation of potentially all 
hematopoietic cell lineages, including blood stem cells. This gene product may be 
involved in the regulation of cytokine production, antigen presentation, or other 
processes that may also suggest a usefulness in the treatment of cancer (e.g. by 
boosting immune responses). 

Since the gene is expressed in cells of lymphoid origin, the gene or protein, as 
well as, antibodies directed against the protein may show utility as a tumor marker 
and/or immunotherapy targets for the above listed tissues. Therefore it may be also 
used as an agent for immunological disorders including arthritis, asthma, immune 
deficiency diseases such as AIDS, leukemia, rheumatoid arthritis, inflammatory 
bowel disease, sepsis, acne, and psoriasis. In addition, this gene product may have 
commercial utility in the expansion of stem cells and committed progenitors of 
various blood lineages, and in the differentiation and/or proliferation of various cell 
types. Protein, as well as, antibodies directed against the protein may show utility as a 
tumor marker and/or r ii^uhotherapy targets for th^aboveTistedTissues. ~ 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO: 14 and may have been publicly available prior to conception of 
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the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence is 
cumbersome. Accordingly, preferably excluded from the present invention are one or 
more polynucleotides comprising a nucleotide sequence described by the general 
5 formula of a-b, where a is any integer between 1 to 1248 of SEQ ID NO: 14, b is an 
integer of 15 to 1262, where both a and b correspond to the positions of nucleotide 
residues shown in SEQ ID NO: 14, and where b is greater than or equal to a + 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 5 

10 In specific embodiments, polypeptides of the invention comprise the following 

amino acid sequences: HLLEVTPCRLPVPEFPGRTPRGSRTPD (SEQ ID NO:216). 
Polynucleotides encoding these polypeptides are also encompassed by the invention. 

This gene is expressed primarily in rapidly dividing liver tissue, (e.g., 
hepatoma, hepatocellular carcinoma, and fetal liver tissue), and to a lesser extent in 

15 normal liver tissue, and other tumors such as colon cancer-arid uterine cancer. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to, cancers, particularly hepatomas, colon cancer, and uterine cancer. 

20 Similarly, polypeptides and antibodies directed to these polypeptides are useful in 
providing immunological probes for differential identification of the tissue(s) or cell 
type(s). For a number of disorders of the above tissues or cells, particularly of the 
liver, colon and uterus, expression of this gene at significantly higher or lower levels 
may be routinely detected in certain tissues or cell types (e.g., liver, colon, uterus, 

25 cancerous and wounded tissues) or bodily fluids (e.g., lymph, serum, plasma, urine, 
synovial fluid and spinal fluid) or another tissue or cell sample taken from an 
individual having such a disorder, relative to the standard gene expression level, i.e., 

the expression level in healthy tissue or bodily fluid from anindividual not having" the"" 

disorder. 
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Preferred epitopes include those comprising a sequence shown in SEQ ID NO: 
1 12 as residues: Trp-35 to Trp-45, Pro-52 to Asp-57, Thr-73 to Arg-82, Pro- 105 to 
Leu-1 12, Pro-1 15 to Arg-127, Pro-140 to Gln-151. 

The tissue distribution in liver tissues and cancers thereof, as well as other 
cancerous tissues, indicates that the protein products of this gene are useful for the 
diagnosis and/or treatment of cancers, particularly, hepatoma, colon cancer and 
uterine cancer, as well as cancers of other tissues where expression has been 
observed. Furthermore, expression within cellular sources marked by proliferating 
cells indicates that this protein may play a role in the regulation of cellular division, 
and may show utility in the diagnosis and treatment of cancer and other proliferative 
disorders. Thus, this protein may also be involved in apoptosis or tissue 
differentiation and could again be useful in cancer therapy. Protein, as well as, 
antibodies directed against the protein may show utility as a tumor marker and/or 
immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences", such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO: 15 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence is 
cumbersome. Accordingly, preferably excluded from the present invention are one or 
more polynucleotides comprising a nucleotide sequence described by the general 
formula of a-b, where a is any integer between 1 to 745 of SEQ ID NO: 15, b is an 

integer of 15 to_759, where_both a and b correspond-to the positions of nucleotide 

residues shown in SEQ ID NO: 15, and where b is greater than or equal to a + 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 6 

This gene is expressed primarily in hepatocellular tumors. 

-Therefore,-polynucleotides-and-polypeptides bfthe~inventibn are usefuFas 

reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to, hepatomas. Similarly, polypeptides and antibodies directed to these 
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polypeptides are useful in providing immunological probes for differential 
identification of the tissue(s) or cell type(s). For a number of disorders of the above 
tissues or cells, particularly of the liver, expression of this gene at significantly higher 
or lower levels may be routinely detected in certain tissues or cell types (e.g., liver, 
5 cancerous and wounded tissues) or bodily fluids (e.g., lymph, serum, plasma, urine, 
synovial fluid and spinal fluid) or another tissue or cell sample taken from an 
individual having such a disorder, relative to the standard gene expression level, i.e., 
the expression level in healthy tissue or bodily fluid from an individual not having the 
disorder. 

10 Preferred epitopes include those comprising a sequence shown in SEQ ID NO: 

1 13 as residues: Pro-32 to Gly-40. 

The tissue distribution in hepatocellular tumors indicates that the protein 
products of this gene are useful for the diagnosis and/or treatment of hepatomas, as 
well as cancers of other tissues where expression has been observed. Furthermore, 

15 expression within cellular sources .marked by proliferating cells indicates that this 
protein may play a role in the regulation of cellular division, and may show utility in 
the diagnosis and treatment of cancer and other proliferative disorders. Thus, this 
protein may also be involved in apoptosis or tissue differentiation and could again be 
useful in cancer therapy. Protein, as well as, antibodies directed against the protein 

20 may show utility as a tumor marker and/or immunotherapy targets for the above listed 
tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences-are - 
related to SEQ ID NO: 16 and may have been publicly available prior to conception of 

25 the present invention. Preferably, such related polynucleotides are specifically 

excluded from the scope of the present invention. To list every related sequence is 
cumbersome. Accordingly, preferably excluded from the present invention are one or 

more polynucleotides eomprising-a nucleotide sequence described by the general 

formula of a-b, where a is any integer between 1 to 1796 of SEQ ID NO: 16, b is an 

30 integer of 15 to 1810, where both a and b correspond to the positions of nucleotide 
residues shown in SEQ ID NO: 16, and where b is greater than or equal to a + 14. 
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FEATURES OF PROTEIN ENCODED BY GENE NO: 7 

This gene is expressed primarily in human rhabdomyosarcoma tissue, as well 
as in placental tissue. 

5 Therefore, polynucleotides and polypeptides of the invention are useful as 

reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to, malignant neoplasms and reproductive disorders. Similarly, 
polypeptides and antibodies directed to these polypeptides are useful in providing . 

10 immunological probes for differential identification of the tissue(s) or cell type(s). For 
a number of disorders of the above tissues or cells, particularly of the skeletal system 
and reproductive system, expression of this gene at significantly higher or lower 
levels may be routinely detected in certain tissues or cell types (e.g., reproductive, 
cancerous and wounded tissues) or bodily fluids (e.g., lymph, serum, plasma, urine, 

15 synovial fluid and spinal fluid) or another tissue or cell sample taken from an 

individual having such a disorder, relative to the standard gene expression level, i.e., 
the expression level in healthy tissue or bodily fluid from an individual not having the 
disorder. 

Preferred epitopes include those comprising a sequence shown in SEQ ID NO: 
20 1 14 as residues: Arg-23 to Trp-28, Phe-93 to Lys-98, Arg-199 to Trp-206, Gly-208 to 
Met-213. 

The tissue distribution in placental tissue and human rhabdomyosarcoma 
—tissue indicates that the protein products of -this gene are useful for the diagnosis 
and/or treatment of skeletal and reproductive disorders. Furthermore, the tissue 
25 distribution in placental tissue indicates that polynucleotides and polypeptides 

corresponding to this gene are useful for the diagnosis and/or treatment of disorders 
of the placenta. Specific expression within the placenta indicates that this gene 
product-may-play a role in the proper este^ 



function. Alternately, this gene product may be produced by the placenta and then 
30 transported to the embryo, where it may play a crucial role in the development and/or 
survival of the developing embryo or fetus. . 
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Expression of this gene product in a vascular-rich tissue such as the placenta 
also indicates that this gene product may be produced more generally in endothelial 
cells or within the circulation. In such instances, it may play more generalized roles in 
vascular function, such as in angiogenesis. It may also be produced in the vasculature 
5 and have effects on other cells within the circulation, such as hematopoietic cells. It 
may serve to promote the proliferation, survival, activation, and/or differentiation of 
hematopoietic cells, as well as other cells throughout the body. Protein, as well as, 
antibodies directed against the protein may show utility as a tumor marker and/or 
immunotherapy targets for the above listed tissues. 

10 Many polynucleotide sequences, such as EST sequences, are publicly 

available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO: 17 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence is 

15 cumbersome. Accordingly, preferably excluded from the present invention are one or 
more polynucleotides comprising a nucleotide sequence described by the general 
formula of a-b, where a is any integer between 1 to 1038 of SEQ ID NO: 17, b is an 
integer of 15 to 1052, where both a and b correspond to the positions of nucleotide 
residues shown in SEQ ID NO: 17, and where b is greater than or equal to a + 14. 

20 

FEATURES OF PROTEIN ENCODED BY GENE NO: 8 

This gene is expressed primarily in fetal liver/spleen and fetal skin tissues, and 
to a lesser extent in. breast cancer tissue. . . _ 

Therefore, polynucleotides and polypeptides of the invention are useful as 
25 reagents for differential identification of the tissue(s) or cell type(s) present in a 

biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to, developmental disorders and neoplasias. Similarly, polypeptides and 

— antibodies directed to these polypeptides-are useful in-providing immunological 

probes for differential identification of the tissue(s) or cell type(s). For a number of 
30 disorders of the above tissues or cells, particularly of the fetal tissue and adult 

immune system, expression of this gene at significantly higher or lower levels may be 
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routinely detected in certain tissues or cell types (e.g., developing, immune, cancerous 
and wounded tissues) or bodily fluids (e.g., lymph, serum, plasma, urine, synovial 
fluid and spinal fluid) or another tissue or cell sample taken from an individual having 
such a disorder, relative to the standard gene expression level, i.e., the expression 
5 level in healthy tissue or bodily fluid from an individual not having the disorder. 

The tissue distribution in fetal liver/spleen and skin tissues indicates that the 
protein products of this gene are useful for the diagnosis and/or treatment of 
developmental disorders and malignant neoplasias. Likewise, expression within fetal 
tissue and other cellular sources marked by proliferating cells indicates that this 

10 protein may play a role in the regulation of cellular division, and may show utility in 
the diagnosis and treatment of cancer and other proliferative disorders. Similarly, fetal 
development also involves decisions involving cell differentiation and/or apoptosis in 
pattern formation. Thus, this protein may also be involved in apoptosis or tissue 
differentiation and could again be useful in cancer therapy. 

1 5 Alternativelyr the tissue distribution in fetal skin tissue indicates that 

polynucleotides and polypeptides corresponding to this gene are useful for the 
treatment, diagnosis, and/or prevention of various skin disorders including congenital 
disorders (i.e. nevi, moles, freckles, Mongolian spots, hemangiomas, port-wine 
syndrome), integumentary tumors (i.e. keratoses, Bowen's disease, basal cell 

20 carcinoma, squamous cell carcinoma, malignant melanoma, Paget' s disease, mycosis 
fungoides, and Kaposi's sarcoma), injuries and inflammation of the skin (i.e.wounds, 
rashes, prickly heat disorder, psoriasis, dermatitis), atherosclerosis, uticaria, eczema, 
photosensitivity, autoimmune disorders (i.e. lupus erythematosus, vitiligo, 
dermatomyositis, morphea, scleroderma, pemphigoid, and pemphigus), keloids, striae, 

25 erythema, petechiae, purpura, and xanthelasma. Moreover, such disorders may 

predispose increased susceptibility to viral and bacterial infections of the skin (i.e. 
cold sores, warts, chickenpox, molluscum contagiosum, herpes zoster, boils, cellulitis, 

ery sipelas,-impetigo,-tinea,-althletes-f oot-, and ringworm)rProtein; as" weiras; 

antibodies directed against the protein may show utility as a tumor marker and/or 

30 immunotherapy targets for the above listed tissues. 
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Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO: 18 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
5 excluded from the scope of the present invention. To list every related sequence is 
cumbersome. Accordingly, preferably excluded from the present invention are one or 
more polynucleotides comprising a nucleotide sequence described by the general 
formula of a-b, where a is any integer between 1 to 1 1 16 of SEQ ID NO: 18, b is an 
integer of 15 to 1 130, where both a and b correspond to the positions of nucleotide 
10 residues shown in SEQ ID NO: 18, and where b is greater than or equal to a + 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 9 

The translation product of this gene shares sequence homology with the 
bacterial guf A gene, as well as a C. elegans protein of unknown function. 

15 In specific embodiments,; polypeptides of the invention comprise the following 

amino acid sequences: MIPGSDSQTALNFGSTLMKKKSDPEGPALLFPESELSIRI 
GRAGLLSDKSENGEAYQRKKAAATGLPEGPAVPVPSRGNLAQPGGSSWRRI 
ALLILAITIHNVPEGLAVGVGFGAIEKTASATFESARNLAIGIGIQNFPEGLAVS 
LPLRGAGFSTWRAFWYGQLSGMVEPLAGVFGAFAVVLAEPILPYALAFAAG 

20 AMVYVVMDDIIPEAQISGNGKLASWASILGFVVMMSLDVGLG (SEQ ID 
NO:217), MIPGSDSQTALNFGSTLMKKKSDPEGPALLFPESELSIRIGRA (SEQ 
ID NO:218), GLLSDKSENGEAYQRKKAAATGLPEGPAVPVPSRGNLAQPG 
(SEQ ID N O : 2 1 9 ) , 

GSSWRRIALLILAITIHNVPEGLAVGVGFGAIEKTASATFESAR (SEQ ID 

25 NO:220), NLAIGIGIQNFPEGLAVSLPLRGAGFSTWRAFWYGQLS GMVEP 
(SEQ ID NO:221), LAGVFGAFAVVLAEPILPYALAFAAGAMVYVVM 
DDIIPEAQIS (SEQ ID NO:222), and/or GNGKLASWASILGFVVMMSLDVGLG 

(SEQ _ID_NQ:223>_ Polynucleotides -eneoding-these- polypeptides ~^are "also - 

encompassed by the invention. 

30 This gene is expressed primarily in cells of the immune system, particularly 

macrophage. 



BNSDOCin: <WO ftft»7540A1 I > 



WO 99/47540 



PCT/US99/05804 



22 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to, disorders of the immune system, such as AIDS, as well as 
inflammatory disorders. Similarly, polypeptides and antibodies directed to these 
polypeptides are useful in providing immunological probes for differential . 
identification of the tissue(s) or cell type(s). For a number of disorders of the above 
tissues or cells, particularly of the immune system, expression of this gene at 
significantly higher or lower levels may be routinely detected in certain tissues or cell 
types (e.g., immune, cancerous and wounded tissues) or bodily fluids (e.g., lymph, 
serum, plasma, urine, synovial fluid and spinal fluid) or another tissue or cell sample 
taken from an individual having such a disorder, relative to the standard gene 
expression level, i.e., the expression level in healthy tissue or bodily fluid from an 
individual not having the disorder. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for the diagnosis and treatment of a variety of 
immune system disorders. Expression of this gene product in immune cells such as 
macrophage indicates a role in the regulation of the proliferation; survival; 
differentiation; and/or activation of potentially all hematopoietic cell lineages, 
including blood stem cells. This gene product may be involved in the regulation of 
cytokine production, antigen presentation, or other processes that may also suggest a 
usefulness in the treatment of cancer (e.g. by boosting immune responses). 

Since the.gene Js.expressed in cells of lymphoid-origin, the-gene or protein, as - - 

well as, antibodies directed against the protein may show utility as a tumor marker 
and/or immunotherapy targets for the above listed tissues. Therefore it may be also 
used as an agent for immunological disorders including arthritis, asthma, immune 
deficiency diseases such as AIDS, leukemia, rheumatoid arthritis, inflammatory 

-bowel diseasersepsisraenerand-psoriasisr ln additionrthis-gene producrmay have 
commercial utility in the expansion of stem cells and committed progenitors of 
various blood lineages, and in the differentiation and/or proliferation of various cell 
types. Expression of this gene product in macrophage also strongly indicates a role 
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for this protein in immune function and immune surveillance. Protein, as well as, 
antibodies directed against the protein may show utility as a tumor marker and/or 
immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
5 available and accessible through sequence databases. Some of these sequences are 

related to SEQ ID NO: 19 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence is 
cumbersome. Accordingly, preferably excluded from the present invention are one or 
10 more polynucleotides comprising a nucleotide sequence described by the general 
formula of a-b, where a is any integer between 1 to 869 of SEQ ID NO: 19, b is an 
integer of 15 to 883, where both a and b correspond to the positions of nucleotide 
residues shown in SEQ ID NO: 19, and where b is greater than or equal to a + 14. 

15 FEATURES OF PROTEIN ENCODED BY GENE NO: 10 

This gene is expressed primarily in the spleen metastic melanoma tissue as 

well as in embryonic tissues. 

Therefore, polynucleotides and polypeptides of the invention are useful as 

reagents for differential identification of the tissue(s) or cell type(s) present in a 
20 biological sample and for diagnosis of diseases and conditions which include, but are 

not limited to, disorders affecting the spleen or immune system, developmental 

disorders, and cancers. Similarly, polypeptides and antibodies directed to these 
_ pojypeptides are useful in-providing immunological probes-for differential - 

identification of the tissue(s) or cell type(s). For a number of disorders of the above 
25 tissues or cells, particularly of the immune system, expression of this gene at 

significantly higher or lower levels may be routinely detected in certain tissues or cell 

types (e.g., spleen, developing, cancerous and wounded tissues) or bodily fluids (e.g., 
lymph, semm,-plasma^urinersynovial-fiuid-and-spinal-fluid)"or"another 

sample taken from an individual having such a disorder, relative to the standard gene 
30 expression level, i.e., the expression level in healthy tissue or bodily fluid from an 

individual not having the disorder. 
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Preferred epitopes include those comprising a sequence shown in SEQ ID NO: 
1 17 as residues: Asn-37 to Lys-44, Ser-73 to Glu-78, Ala- 103 to Ser-1 1 1. 

The tissue distribution in spleen metastic melanoma and embryonic tissues 
indicates that the protein products of this gene are useful for the diagnosis and/or 
treatment of disorders affecting the spleen, including cancers of the spleen, as well as 
cancers of other tissues where expression has been observed. Furthermore, expression 
within embryonic tissue and other cellular sources marked by proliferating cells 
indicates that this protein may play a role in the regulation of cellular division, and 
may show utility in the diagnosis and treatment of cancer and other proliferative 
disorders. Similarly, embryonic development also involves decisions involving cell 
differentiation and/or apoptosis in pattern formation. Thus, this protein may also be 
involved in apoptosis or tissue differentiation and could again be useful in cancer 
therapy. Protein, as well as, antibodies directed against the protein may show utility as 
a tumor marker and/or immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO:20 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence is 
cumbersome. Accordingly, preferably excluded from the present invention are one or 
more polynucleotides comprising a nucleotide sequence described by the general 
formula of a-b, where a is any integer between 1 to 975 of SEQ ID NO:20, b is an 
integer of 15 to 989, where both a and b correspond to the positions of nucleotide 
residues shown in SEQ ID NO:20, and where b is greater than or equal to a + 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 11 

It has been discovered that this gene is expressed primarily in cells of the 

immune systemrinclucling" monocytes and neutrophils. — 

Therefore, nucleic acids of the invention are useful as reagents for differential 

identification of the tissue(s) or cell type(s) present in a biological sample and for 

diagnosis of the following diseases and conditions: disorders affecting the immune 
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system, such as AIDS. Similarly, polypeptides and antibodies directed to those 
polypeptides are useful to provide immunological probes for differential identification 
of the tissue(s) or cell type(s). For a number of disorders of the above tissues or cells, 
particularly of the immune system, expression of this gene at significantly higher or 
lower levels may be detected in certain tissues or cell types (e.g., immune, cancerous 
and wounded tissues) or bodily fluids (e.g., lymph, serum, plasma, urine, synovial 
fluid or spinal fluid) taken from an individual having such a disorder, relative to the 
standard gene expression level, i.e., the expression level in healthy tissue from an 
individual not having the disorder. 

Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 
1 1 8 as residues: Ser-12 to Asp-20, Gly-22 to Gly-32, Ala-49 to Thr-57. 

The tissue distribution in monocytes and neutrophils indicates that the protein 
products of this clone are useful for the diagnosis and/or treatment of immune system 
disorders, including AIDS. Furthermore, expression of this gene product in 
monocytes and neutrophils suggests a role in the regulation of the proliferation; 
survival; differentiation; and/or activation of potentially all hematopoietic cell 
lineages, including blood stem cells. This gene product may be involved in the 
regulation of cytokine production, antigen presentation, or other processes that may 
also suggest a usefulness in the treatment of cancer (e.g. by boosting immune 
responses). 

Since the gene is expressed in cells of lymphoid origin, the gene or protein, as 
well as, antibodies directed against the protein may show utility as a tumor marker 
and/or immunotherapy targets-for the above-listed tissues." Therefore it may be also ~ 
used as an agent for immunological disorders including arthritis, asthma, immune 
deficiency diseases such as AIDS, leukemia, rheumatoid arthritis, inflammatory 
bowel disease, sepsis, acne, and psoriasis. In addition, this gene product may have 
commercial utility in the expansion of stem cells and committed progenitors of 
-various blood-lineages; and in the differemia^ cell 
types. Expression of this gene product in monocytes and neutrophils also strongly 
suggests a role for this protein in immune function and immune surveillance. Protein, 
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as well as, antibodies directed against the protein may show utility as a tumor marker 
and/or immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO:21 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence 
would be cumbersome. Accordingly, preferably excluded from the present invention 
are one or more polynucleotides comprising a nucleotide sequence described by the 
general formula of a-b, where a is any integer between 1 to 481 of SEQ ID NO:21, b 
is an integer of 15 to 495, where both a and b correspond to the positions of 
nucleotide residues shown in SEQ ID NO:21, and where b is greater than or equal to a 
+ 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 12 

It has been discovered that this gene is expressed primarily in cells of the 
immune system, including monocytes. 

Therefore, nucleic acids of the invention are useful as reagents for differential 
identification of the tissue(s) or cell type(s) present in a biological sample and for 
diagnosis of the following diseases and conditions: disorders affecting the immune 
system. Similarly, polypeptides and antibodies directed to those polypeptides are 
useful to provide immunological probes for differential identification of the tissue(s) 
or cell type(s). For a number of disorders of the above tissues or cells, particularly of 
the immune system, expression of this gene at significantly higher or lower levels 
may be detected in certain tissues or cell types (e.g., immune, cancerous and wounded 
tissues) or bodily fluids (e.g., lymph, serum, plasma, urine, synovial fluid or spinal 
fluid) taken from an individual having such a disorder, relative to the standard gene 
expressionlevelri.e., the~expre~ssidnT^ tissu^fi^nTanTi^ivBual not 

having the disorder. 

Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 
1 19 as residues: Glu-35 to Trp-42. 
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The tissue distribution suggests that the protein product of this clone is useful 
for the diagnosis and treatment of a variety of immune system disorders. Expression 
of this gene product in monocytes suggests a role in the regulation of the 
proliferation; survival; differentiation; and/or activation of potentially all 
5 hematopoietic cell lineages, including blood stem cells. This gene product may be 
involved in the regulation of cytokine production, antigen presentation, or other 
processes that may also suggest a usefulness in the treatment of cancer (e.g. by 
boosting immune responses). 

Since the gene is expressed in cells of lymphoid origin, the gene or protein, as 

10 well as, antibodies directed against the protein may show utility as a tumor marker 
and/or immunotherapy targets for the above listed tissues. Therefore it may be also 
used as an agent for immunological disorders including arthritis, asthma, immune 
deficiency diseases such as AIDS, leukemia, rheumatoid arthritis, inflammatory 
bowel disease, sepsis, acne, and psoriasis. In addition, this gene product may have 

15 commercial utilityTn the expansion of stem cells and committed progenitors of 

various blood lineages, and in the differentiation and/or proliferation of various cell 
types. Expression of this gene product in monocytes also strongly suggests a role for 
this protein in immune function and immune surveillance. Protein, as well as, 
antibodies directed against the protein may show utility as a tumor marker and/or 

20 immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO: 22 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 

25 excluded from the scope of the present invention. To list every related sequence 

would be cumbersome. Accordingly, preferably excluded from the present invention 
are one or more polynucleotides comprising a nucleotide sequence described by the 
general formula of a-b7 where a is anyinteger between T to "2303 "of SEQ II) NO:22, b 
is an integer of 15 to 2317, where both a and b correspond to the positions of 

30 nucleotide residues shown in SEQ ID NO:22, and where b is greater than or equal to a 
+ 14. 
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FEATURES OF PROTEIN ENCODED BY GENE NO: 13 

It has been discovered that this gene is expressed primarily in cells of the 
immune system, including monocytes. 
5 Therefore, nucleic acids of the invention are useful as reagents for differential 

identification of the tissue(s) or cell type(s) present in a biological sample and for 
diagnosis of the following diseases and conditions: disorders of the immune system. 
Similarly, polypeptides and antibodies directed to those polypeptides are useful to 
provide immunological probes for differential identification of the tissue(s) or cell 

10 type(s). For a number of disorders of the above tissues or cells, particularly of the 

immune system, expression of this gene at significantly higher or lower levels may be 
detected in certain tissues or cell types (e.g., immune, cancerous and wounded tissues) 
or bodily fluids (e.g., lymph, serum, plasma, urine, synovial fluid or spinal fluid) 
taken from an individual having such a disorder, relative to the standard gene 

15 expression level, i.e/, the expression level in healthy tissue from an individual not 
having the disorder. 

The tissue distribution in monocytes indicates that the protein products of this 
clone are useful for the diagnosis and/or treatment of disorders of the immune system. 
Expression of this gene product in monocytes suggests a role in the regulation of the 

20 proliferation; survival; differentiation; and/or activation of potentially all 

hematopoietic cell lineages, including blood stem cells. This gene product may be 
involved in the regulation of cytokine production, antigen presentation, or other 
processes that may also suggest a usefulness in the treatment of cancer (e.g. by 
boosting immune responses). 

25 Since the gene is expressed in cells of lymphoid origin, the gene or protein, as 

well as, antibodies directed against the protein may show utility as a tumor marker 
and/or immunotherapy targets for the above listed tissues. Therefore it may be also 
used~as~ ^^gentToTimmunologic arthritis^ asthma,~iihmune 

deficiency diseases such as AIDS, leukemia, rheumatoid arthritis, inflammatory 

30 bowel disease, sepsis, acne, and psoriasis. In addition, this gene product may have 
commercial utility in the expansion of stem cells and committed progenitors of 
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various blood lineages, and in the differentiation and/or proliferation of various cell 
types. Expression of this gene product in monocytes also strongly suggests a role for 
this protein in immune function and immune surveillance. Protein, as well as, 
antibodies directed against the protein may show utility as a tumor marker and/or 
immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO:23 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence 
would be cumbersome. Accordingly, preferably excluded from the present invention 
are one or more polynucleotides comprising a nucleotide sequence described by the 
general formula of a-b, where a is any integer between 1 to 1712 of SEQ ID NO:23, b 
is an integer of 15 to 1726, where both a and b correspond to the positions of 
nucleotide residues shown in SEQ ID NO:23, and where b is greater than or equal to a 
+ 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 14 

The translation product of this gene shares sequence homology with a gene 
from C. elegans of unknown function. 

In specific embodiments, polypeptides of the invention comprise the following 
amino acid sequences: TRPITYVLLAG (SEQ ID NO:224). Polynucleotides encoding 
these polypeptides are also encompassed by the invention. The gene encoding the 
disclosed cDN A is thought to reside on chromosome 1 1 . Accordingly, 
polynucleotides related to this invention are useful as a marker in linkage analysis for 
chromosome 11. 

It has been discovered that this gene is expressed primarily in fetal lung, liver, 
- spleen and heart tissues, as well as adult liver, bladder; ^nadmetriMWomal cells, 
synovium, colon cancer, smooth muscle, keratinocytes, and the bone marrow derived 
cell line RS4;11. 
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Therefore, nucleic acids of the invention are useful as reagents for differential 
identification of the tissue(s) or cell type(s) present in a biological sample and for 
diagnosis of the following diseases and conditions: disorders of the musculo-skeletal 
system, and cancers of the immune system. Similarly, polypeptides and antibodies 
5 directed to those polypeptides are useful to provide immunological probes for 

differential identification of the tissue(s) or cell type(s). For a number of disorders of 
the above tissues or cells, particularly of the musculo-skeletal and immune systems, 
expression of this gene at significantly higher or lower levels may be detected in 
certain tissues or cell types (e.g., immune, musculo-skeletal, cancerous and wounded 

10 tissues) or bodily fluids (e.g., lymph, serum, plasma, urine, synovial fluid or spinal 
fluid) taken from an individual having such a disorder, relative to the standard gene 
expression level, i.e., the expression level in healthy tissue from an individual not 
having the disorder. 

The tissue distribution in tissues of the immune system indicates that the 

15 protein products oF this ;cloneare useful for treating proliferative disorders of immune 
system precursor cells. Alternatively, the tissue distribution in smooth muscle and 
heart tissue indicates that the protein product of this gene is useful for the diagnosis 
and treatment of conditions and pathologies of the cardiovascular system, such as 
heart disease, restenosis, atherosclerosis, stoke, angina, thrombosis, and wound 

20 healing. Protein, as well as, antibodies directed against the protein may show utility as 
a tumor marker and/or immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO:24 and may have been publicly available prior to conception of 

25 the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence 
would be cumbersome. Accordingly, preferably excluded from the present invention 

are one or-more polynucleotides comprising a nu 

general formula of a-b, where a is any integer between 1 to 515 of SEQ ID NO:24, b 

30 is an integer of 15 to 529, where both a and b correspond to the positions of 
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nucleotide residues shown in SEQ ID NO:24, and where b is greater than or equal to a 
+ 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 15 
5 In specific embodiments, polypeptides of the invention comprise the following 

amino acid sequences: GTSLTAPLLEFLLALYFLFADAMQLNDKWQGLCWP 
(SEQ ID NO:225). Polynucleotides encoding these polypeptides are also 
encompassed by the invention. 

It has been discovered that this gene is expressed primarily in T-cells, fetal 
10 spleen and infant brain tissues, and to a lesser extent in many other tissues including 
melanocytes, lung cancer, macrophages, dendritic cells, stromal cells, adrenal gland 
and others. 

Therefore, nucleic acids of the invention are useful as reagents for differential 
identification of the tissue(s) or cell type(s) present in a biological sample and for 

15 diagnosis of the following diseases; and conditions: inflammation and autoimmunity, 
developing tissues. Similarly, polypeptides and antibodies directed to those 
polypeptides are useful to provide immunological probes for differential identification 
of the ussue(s) or cell type(s). For a number of disorders of the above tissues or cells, 
particularly of the immune and developing system, expression of this gene at 

20 significantly higher or lower levels may be detected in certain tissues or cell types 
(e.g., immune, developing, cancerous and wounded tissues) or bodily fluids (e.g., 
lymph, serum, plasma, urine, synovial fluid or spinal fluid) taken from an individual 
having such a disorder, relative to the standard gene expression level, i.e., the 
expression level in healthy tissue from an individual not having the disorder. 

25 Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 

122 as residues: Ser-46 to Gly-5 1 . 

The tissue distribution in T-cells and other immune cells indicates that the 

protein products of this clone are useful for-treating diseases in volving"the~activation~ 

of T-cells, including inflammation and autoimmune diseases. Alternatively, the tissue 

30 distribution in a wide range of fetal tissues suggests that this protein may play a role 
in the regulation of cellular division, and may show utility in the diagnosis and 
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treatment of cancer and other proliferative disorders. Similarly, fetal development 
also involves decisions involving cell differentiation and/or apoptosis in pattern 
formation. Thus, this protein may also be involved in apoptosis or tissue 
differentiation and could again be useful in cancer therapy. Protein, as well as, 
5 antibodies directed against the protein may show utility as a tumor marker and/or 
immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO:25 and may have been publicly available prior to conception of 

10 the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence 
would be cumbersome. Accordingly, preferably excluded from the present invention 
are one or more polynucleotides comprising a nucleotide sequence described by the 
general formula of a-b, where a is any integer between 1 to 1741 of SEQ ID NO:25, b 

15 is an integer of 15 to 1755, where both" a and b correspond to the positions of 

nucleotide residues shown in SEQ ID NO:25, and where b is greater than or equal to a 
+ 14. 

' FEATURES OF PROTEIN ENCODED BY GENE NO: 16 

20 In specific embodiments, polypeptides of the invention comprise the following 

amino acid sequences: LANFZCSDCAQTVLFVLZFZILVFTYEIPF (SEQ ID 
NO: 226). Polynucleotides encoding these polypeptides are also encompassed by the 
invention. The gene encoding.the disclosed cDNA-is thought to reside on - - 
chromosome 13. Accordingly, polynucleotides related to this invention are useful as a 

25 marker in linkage analysis for chromosome 13. Recently another group published this 
gene, referring to it as CLN5 (See Genbank Accession No.: 3342386). 

It has been discovered that this gene is expressed primarily in placental tissue, 

12_we.ek_embryos,-and-tumors-including-testes^-tongue andpharynxr^'d t^^lesser 

extent in adipose tissue, tonsils, melanocytes, fetal spleen, macrophages, T-cells, 

30 amniotic cells, and brain tissue. 
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Therefore, nucleic acids of the invention are useful as reagents for differential 
identification of the tissue(s) or cell type(s) present in a biological sample and for 
diagnosis of the following diseases and conditions: tumors, particularly of the tongue 
and throat, and neurodegenerative disorders. Similarly, polypeptides and antibodies 
5 directed to those polypeptides are useful to provide immunological probes for 

differential identification of the tissue(s) or cell type(s). For a number of disorders of 
the above tissues or cells, particularly of the neural and digestive systems, expression 
of this gene at significantly higher or lower levels may be detected in certain tissues 
or cell types (e.g., tongue, throat, brain, cancerous and wounded tissues) or bodily 

10 fluids (e.g., lymph, serum, plasma, urine, synovial fluid or spinal fluid) taken from an 
individual having such a disorder, relative to the standard gene expression level, i.e., 
the expression level in healthy tissue from an individual not having the disorder. 

Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 
123 as residues: Pro-44 to Ala-60, Val-187 to Thr-193, Lys-203 to Ala-210, Thr-212 

15 toCys-219. " 

The tissue distribution in tongue and pharynx carcinoma tissue indicates that 
the protein products of this clone are useful for diagnosing and/or treating oral 
cancers, including tumors of the throat and tongue. Furthermore, the tissue 
distribution in brain tissue suggests that the protein product of this clone is useful for 

20 the detection/treatment of neurodegenerative disease states and behavioural disorders 
such as neuronal ceroid lipofuscinoses (NCLs), Alzheimers Disease, Parkinsons 
Disease, Huntingtons Disease, Tourette Syndrome, schizophrenia, mania, dementia, 
paranoia, obsessive compulsive disorder, panic disorder, learning disabilities, ALS, 
psychoses, autism, and altered behaviors, including disorders in feeding, sleep 

25 patterns, balance, and perception. In addition, the gene or gene product may also play 
a role in the treatment and/or detection of developmental disorders associated with the 
developing embryo, or sexually-linked disorders. Protein, as well as, antibodies 

directedagainst the protein may show utility ^ 

immunotherapy targets for the above listed tissues. 

30 Many polynucleotide sequences, such as EST sequences, are publicly 

available and accessible through sequence databases. Some of these sequences are 
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related to SEQ ID NO:26 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence 
would be cumbersome. Accordingly, preferably excluded from the present invention 
5 are one or more polynucleotides comprising a nucleotide sequence described by the 
general formula of a-b, where a is any integer between 1 to 1737 of SEQ ID NO:26, b 
is an integer of 15 to 175 1 , where both a and b correspond to the positions of 
nucleotide residues shown in SEQ ID NO:26, and where b is greater than or equal to a 
+ 14. 

10 

FEATURES OF PROTEIN ENCODED BY GENE NO: 17 

In specific embodiments, polypeptides of the invention comprise the following 
amino acid sequences: 

QAWHEVGGGVRRCWFVLGERRAGSLLSASYGTFAMPG 

1 5 MVLFGRRW AIASDDLVFPGFFiELVVRVLWWIGILTLYL (SEQ ID NO:227), 

and/or PGMVLFGRRWAIASDDLVFPGFFELVVRVLWWIGILTLYLMHRGKLD 
CAGGALLSSYLIVLMILLAVVICTVSAIMCVSMRGTICNPGPRKSMSKLLYIRL 
ALFFPEMVWASLGAAWVADGVQCD (SEQ ID NO:228). Polynucleotides 
encoding these polypeptides are also encompassed by the invention. 

20 It has been discovered that this gene is expressed in activated neutrophils, 

infant brain tissue and primary dendritic cells. 

Therefore, nucleic acids of the invention are useful as reagents for differential 
identification of the tissue(s) or cell type(s) present in a biological sample and for 
diagnosis of the following diseases and conditions: disorders of the immune system, 

25 and neurodegenerative disorders. Similarly, polypeptides and antibodies directed to 
those polypeptides are useful to provide immunological probes for differential 
identification of the tissue(s) or cell type(s). For a number of disorders of the above 
tissues or cells, particularly of the immune and neural systems, expression of this 
gene at significantly higher or lower levels may be detected in certain tissues or cell 

30 types (e.g., immune, brain, cancerous and wounded tissues) or bodily fluids (e.g., 

lymph, serum, plasma, urine, synovial fluid or spinal fluid) taken from an individual 
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having such a disorder, relative to the standard gene expression level, i.e., the 
expression level in healthy tissue from an individual not having the disorder. 

Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 
124 as residues: Pro-47 to Met-53, Serrl30 to Ser-138. 
5 The tissue distribution in neutrophils and primary dendritic cells indicates that 

the protein products of this clone are useful for diagnosing and/or treating immune 
system disorders. Expression of this gene product in neutrophils and primary dendritic 
cells suggests a role in the regulation of the proliferation; survival; differentiation; 
and/or activation of potentially all hematopoietic cell lineages, including blood stem 

10 cells. This gene product may be involved in the regulation of cytokine production, 
antigen presentation, or other processes that may also suggest a usefulness in the 
treatment of cancer (e.g. by boosting immune responses). 

Since the gene is expressed in cells of lymphoid origin, the gene or protein, as 
well as, antibodies directed against the protein may show utility as a tumor marker 

15 and/or immunotherapy targets. for. the a~bove listed tissues. Therefore it may be also 
used as an agent for immunological disorders including arthritis, asthma, immune 
deficiency diseases such as AIDS, leukemia, rheumatoid arthritis, inflammatory 
bowel disease, sepsis, acne, and psoriasis. In addition, this gene product may have 
commercial utility in the expansion of stem cells and committed progenitors of 

20 various blood lineages, and in the differentiation and/or proliferation of various cell 
types. Expression of this gene product in neutrophils and primary dendritic cells also 
strongly suggests a role for this protein in immune function and immune surveillance. 

Alternatively, the tissue distribution in brain tissue suggests that the protein 
product of this clone is useful for the detection/treatment of neurodegenerative 

25 disease states and behavioural disorders such as Alzheimers Disease, Parkinsons 

Disease, Huntingtons Disease, Tourette Syndrome, schizophrenia, mania, dementia, 
paranoia, obsessive compulsive disorder, panic disorder, learning disabilities, ALS, 

psyehosesv autism,andaltered behaviorsrineluding disorders in feediifgrsleep 

patterns, balance, and perception. In addition, the gene or gene product may also play 

30 a role in the treatment and/or detection of developmental disorders associated with the 
developing embryo, or sexually-linked disorders. Protein, as well as, antibodies 
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directed against the protein may show utility as a tumor marker and/or 
immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO:27 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence 
would be cumbersome. Accordingly, preferably excluded from the present invention 
are one or more polynucleotides comprising a nucleotide sequence described by the 
general formula of a-b, where a is any integer between 1 to 1 198 of SEQ ID NO:27, b 
is an integer of 15 to 1212, where both a and b correspond to the positions of 
nucleotide residues shown in SEQ ID NO:27, and where b is greater than or equal to a 
+ 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 18 

It has been discovered that this gene is expressed primarily in neutrophils, and 
to a lesser extent in other tissues. 

Therefore, nucleic acids of the invention are useful as reagents for differential 
identification of the tissue(s) or cell type(s) present in a biological sample and for 
diagnosis of the following diseases and conditions: immune and inflammatory 
disorders. Similarly, polypeptides and antibodies directed to those polypeptides are 
useful to provide immunological probes for differential identification of the tissue(s) 
- or cell type(s). For a number of disorders of the above tissues or cells, particularly of — 
the immune system, expression of this gene at significantly higher or lower levels 
may be detected in certain tissues or cell types (e.g., immune, cancerous and wounded 
tissues) or bodily fluids (e.g., lymph, serum, plasma, urine, synovial fluid or spinal 
fluid) taken from an individual having such a disorder, relative to the standard gene 
expression level; i:e7, the^xpre"ssio^ 
having the disorder. 

Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 
125 as residues: Gln-17 to Ser-24. 
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The tissue distribution in neutrophils indicates that the protein products of this 
clone are useful for the diagnosis and/or treatment of immune and inflammatory 
disorders. Expression of this gene product in neutrophils suggests a role in the 
regulation of the proliferation; survival; differentiation; and/or activation of 
potentially all hematopoietic cell lineages, including blood stem cells. This gene 
product may be involved in the regulation of cytokine production, antigen 
presentation, or other processes that may also suggest a usefulness in the treatment of 
cancer (e.g. by boosting immune responses). 

Since the gene is expressed in cells of lymphoid origin, the gene or protein, as 
well as, antibodies directed against the protein may show utility as a tumor marker 
and/or immunotherapy targets for the above listed tissues. Therefore it may be also 
used as an agent for immunological disorders including arthritis, asthma, immune 
deficiency diseases such as AIDS, leukemia, rheumatoid arthritis, inflammatory 
bowel disease, sepsis, acne, and psoriasis. In addition, this gene product may have 
commercial utility In the expansion of stem cells and committed progenitors of 
various blood lineages, and in the differentiation and/or proliferation of various cell 
types. Expression of this gene product in neutrophils also strongly suggests a role for 
this protein in immune function and immune surveillance. Protein, as well as, 
antibodies directed against the protein may show utility as a tumor marker and/or 
immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO:28 and may have been publicly available prior to conception- of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence 
would be cumbersome. Accordingly, preferably excluded from the present invention 
are one or more polynucleotides comprising a nucleotide sequence described by the 
general formula of a-b ) -where-a-is-any-integer-between4-to-1098 of SEQ ID NO:28, b - 
is an integer of 15 to 1112, where both a and b correspond to the positions of 
nucleotide residues shown in SEQ ID NO:28, and where b is greater than or equal to a 
+ 14. 
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FEATURES OF PROTEIN ENCODED BY GENE NO: 19 

In specific embodiments, polypeptides of the invention comprise the following 
amino acid sequences: HERNCFPM WLNHS AFPP V (SEQ ID NO:229). 
5 Polynucleotides encoding these polypeptides are also encompassed by the invention. 

It has been discovered that this gene is expressed primarily in neutrophils, and 
to a lesser extent in other tissues. 

Therefore, nucleic acids of the invention are useful as reagents for differential 
identification of the tissue(s) or cell type(s) present in a biological sample and for 
10 diagnosis of the following diseases and conditions: immune and inflammatory 

disorders. Similarly, polypeptides and antibodies directed to those polypeptides are 
useful to provide immunological probes for differential identification of the tissue(s) 
or cell type(s). For a number of disorders of the above tissues or cells, particularly of 
the immune system, expression of this gene at significantly higher or lower levels 
15 may be detected in "certain tissues or cell types (e.g., immune, cancerous and wounded 
tissues) or bodily fluids (e.g., lymph, serum, plasma, urine, synovial fluid or spinal 
fluid) taken from an individual having such a disorder, relative to the standard gene 
expression level, i.e., the expression level in healthy tissue from an individual not 
having the disorder. 

20 The tissue distribution in neutrophils indicates that the protein products of this 

clone are useful for the diagnosis and/or treatment of immune and inflammatory 

disorders. Expression of this gene product in neutrophils suggests a role in the 
regulation of the proliferation; survival; differentiation; and/or activation"of 

potentially all hematopoietic cell lineages, including blood stem cells. This gene 
25 product may be involved in the regulation of cytokine production, antigen 

presentation, or other processes that may also suggest a usefulness in the treatment of 

cancer (e.g. by boosting immune responses). 
Since-the-gene is expressedin "cells""of lympfioia^origin,1he gene orprblein, as 

well as, antibodies directed against the protein may show utility as a tumor marker 
30 and/or immunotherapy targets for the above listed tissues. Therefore it may be also 

used as an agent for immunological disorders including arthritis, asthma, immune 
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deficiency diseases such as AIDS, leukemia, rheumatoid arthritis, inflammatory 
bowel disease, sepsis, acne, and psoriasis. In addition, this gene product may have 
commercial utility in the expansion of stem cells and committed progenitors of 
various blood lineages, and in the differentiation and/or proliferation of various cell 
5 types. Expression of this gene product in neutrophils also strongly suggests a role for 
this protein in immune function and immune surveillance. Protein, as well as, 
antibodies directed against the protein may show utility as a tumor marker and/or 
immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 

10 available and accessible through sequence databases. Some of these sequences are 

related to SEQ ID NO:29 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence 
would be cumbersome. Accordingly, preferably excluded from the present invention 

15 are one or more polynucleotides comprising a nucleotide sequence described by the 
general formula of a-b, where a is any integer between 1 to 734 of SEQ ID NO:29, b 
is an integer of 15 to 748, where both a and b correspond to the positions of 
nucleotide residues shown in SEQ ID NO:29, and where b is greater than or equal to a 
+ 14. 

20 

FEATURES OF PROTEIN ENCODED BY GENE NO: 20 

In specific embodiments, polypeptides of the invention comprise the following 

amino acid sequences: GWTRENDHRALSKAGIGSAEIQPSNLRVGSAKDLGKPW 

AGKLLLLSSCLLFFSLGVLYRGQMLAPPLQEDWKGGVKDSDLIDDSSASPIPP 
25 SYLEYKAALYPFSEHKSVRNATDSLTFFLVTDHFLDNQDSQ (SEQ ID 

NO:230), GWTRENDHRALSKAiGIGSAEIQPSNLRVGSAKDLGKPWAGKLLLL 

(SEQIDNO:231), 
SSGLLFFSLGVtYRGQM^ 

NO:232), and/or S YLE YKA AL YPFSEHKS VRN ATDS LTFFL VTDHFL DNQDSQ 
30 (SEQ ID NO:233). Polynucleotides encoding these polypeptides are also 

encompassed by the invention. 
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It has been discovered that this gene is expressed primarily in ovarian cancer 
tissue, and to a lesser extent in other tissues. 

Therefore, nucleic acids of the invention are useful as reagents for differential 
identification of the tissue(s) or cell type(s) present in a biological sample and for 
5 diagnosis of the following diseases and conditions: ovarian cancer. Similarly, 
polypeptides and antibodies directed to those polypeptides are useful to provide 
immunological probes for differential identification of the tissue(s) or cell type(s). For 
a number of disorders of the above tissues or cells, particularly of the ovaries, 
expression of this gene at significantly higher or lower levels may be detected in 
10 certain tissues or cell types (e.g., reproductive, cancerous and wounded tissues) or 
bodily fluids (e.g., lymph, serum, plasma, urine, synovial fluid or spinal fluid) taken 
from an individual having such a disorder, relative to the standard gene expression 
level, i.e., the expression level in healthy tissue from an individual not having the 
disorder. 

15 Preferred epitopes include, those comprising a sequence shown in SEQ ID NO. 

127 as residues: Thr-20 to Gly-27, Gly-32 to Phe-41 . 

The tissue distribution in ovarian cancer tissue indicates that the protein 
products of this clone are useful for the diagnosis and/or treatment of ovarian cancer, 
as well as cancers of other tissues where expression has been observed. Protein, as 

20 well as, antibodies directed against the protein may show utility as a tumor marker 
and/or immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these 
related to SEQ ID NO:30 and may have been publicly available prior to conception of 

25 the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence 
would be cumbersome. Accordingly, preferably excluded from the present invention 
are one or more polynucleoddes comprisi^ 

general formula of a-b, where a is any integer between 1 to 764 of SEQ ID NO:30, b 
30 is an integer of 15 to 778, where both a and b correspond to the positions of 
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nucleotide residues shown in SEQ ID NO:30, and where b is greater than or equal to a 
'+ 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 21 
5 When tested against U937 Myeloid cell lines, supernatants removed from cells 

containing this gene activated the GAS assay. Thus, it is likely that this gene activates 
myeloid cells, and to a lesser extent other cells, through the Jak-STAT signal 
transduction pathway. The gamma activating sequence (GAS) is a promoter element 
found upstream- of many genes which are involved in the Jak-STAT pathway. The 

10 Jak-STAT pathway is a large, signal transduction pathway involved in the 

differentiation and proliferation of cells. Therefore, activation of the Jak-STAT 
pathway, reflected by the binding of the GAS element, can be used to indicate 
proteins involved in the proliferation and differentiation of cells. 

In specific embodiments, polypeptides of the invention comprise the following 

15 amino acid sequences: LKFHQESLSGD (SEQ ID NO:234). Polynucleotides 
encoding these polypeptides are also encompassed by the invention. 

It has been discovered that this gene is expressed primarily in fast-growing 
tissues such as immune/hematopoietic tissues, early developmental stage human 
tissues, and tumor tissues, and to a lesser extent in some other tissues. 

20 Therefore, nucleic acids of the invention are useful as reagents for differential 

identification of the tissue(s) or cell type(s) present in a biological sample and for 
diagnosis of the following diseases and conditions: growth disorders, immune and 
inflammatory diseases, and tumori genesis. Similarly, polypeptides and antibodies 
directed to those polypeptides are useful to provide immunological probes for 

25 differential identification of the tissue(s) or cell type(s). For a number of disorders of 
the above tissues or cells, particularly of the immune/hematopoietic system, 
expression of this gene at significantly higher or lower levels may be detected in 
c"eitairrtissues^r"ceir types (e.g. rimmune7c~arKerous M^^undedlissues^r bodily 
fluids (e.g., lymph, serum, plasma, urine, synovial fluid or spinal fluid) taken from an 

30 individual having such a disorder, relative to the standard gene expression level, i.e., 
the expression level in healthy tissue from an individual not having the disorder. 
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Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 
128 as residues: Glu-60 to Arg-65. 

The tissue distribution in immune tissues, in conjunction with the biological 
activity data, indicates that the protein products of this clone are useful for the 
diagnosis and/or treatment of growth disorders, immune and inflammatory diseases, 
and tumorigenesis. Furthermore, expression within embryonic tissue and other 
cellular sources marked by proliferating cells suggests that this protein may play a 
role in the regulation of cellular division, and may show utility in the diagnosis and 
treatment of cancer and other proliferative disorders. Similarly, embryonic 
development also involves decisions involving cell differentiation and/or apoptosis in 
pattern formation. Thus, this protein may also be involved in apoptosis or tissue 
differentiation and could again be useful in cancer therapy. Protein, as well as, 
antibodies directed against the protein may show utility as a tumor marker and/or 
immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO:31 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence 
would be cumbersome. Accordingly, preferably excluded from the present invention 
are one or more polynucleotides comprising a nucleotide sequence described by the 
general formula of a-b, where a is any integer between 1 to 1310 of SEQ ID NO:31, b 
is an integer of 1 5 to 1 324, where both a and b correspond to the positions of 
nucleotide residues shown in SEQ ID NO:31, and where b is greater than or equal to a 
+ 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 22 

In specific embodiments,-polypeptides of the invention comprise the'follbwing" 

amino acid sequences: EAKSRPVTQAGVQWHDLGSLQPLPP (SEQ ID NO:235). 
Polynucleotides encoding these polypeptides are also encompassed by the invention. 
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It has been discovered that this gene is expressed primarily in ovarian cancer 
tissue, and to a lesser extent in fetal liver/spleen and retinal tissues. 

Therefore, nucleic acids of the invention are useful as reagents for differential 
identification of the tissue(s) or cell type(s) present in a biological sample and for 
diagnosis of the following diseases and conditions: ovarian cancer, immune disorders, 
and retinal disorders. Similarly, polypeptides and antibodies directed to those 
polypeptides are useful to provide immunological probes for differential identification 
of the tissue(s) or cell type(s). For a number of disorders of the above tissues or cells, 
particularly of the ovaries, immune and ocular systems, expression of this gene at 
significantly higher or lower levels may be detected in certain tissues or cell types 
(e.g., reproductive, ovaries, retina, immune, cancerous and wounded tissues) or bodily 
fluids (e.g;, lymph, serum, plasma, urine, synovial fluid or spinal fluid) taken from an 
individual having such a disorder, relative to the standard gene expression level, i.e., 
the expression level in healthy tissue from an individual not having the disorder. 

The tissue distribution in; ovarian cancer tissue indicates that the protein 
products of this clone are useful for the diagnosis and/or treatment of ovarian cancer, 
as well as cancers of other tissues where expression has been observed. The tissue 
distribution also suggests that the protein product of this clone is useful for the 
diagnosis and treatment of a variety of immune system disorders. Expression of this 
gene product in fetal liver/spleen suggests a role in. the regulation of the proliferation; 
survival; differentiation; and/or activation of potentially all hematopoietic cell 
lineages, including blood stem cells. This gene product may be involved in the 
regulation of cytokine production, antigen presentation, or other processes that may 
also suggest a usefulness in the treatment of cancer (e.g. by boosting immune 
responses). 

Since the gene is expressed in cells of lymphoid origin, the gene or protein, as 
well as, antibodies directed against the protein may show utility as a tumor marker 

and/or_immunotherapy targets-for-the above listed tissues. Therefore it may be~also 

used as an agent for immunological disorders including arthritis, asthma, immune 
deficiency diseases such as AIDS, leukemia, rheumatoid arthritis, inflammatory 
bowel disease, sepsis, acne, and psoriasis. In addition, this gene product may have 
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commercial utility in the expansion of stem cells and committed progenitors of 
various blood lineages, and in the differentiation and/or proliferation of various cell 
types. Alternatively, the tissue distribution in retinal tissue suggests that the protein 
product of this clone is useful for the treatment and/or detection of eye disorders 
including blindness, color blindness, impaired vision, short and long sightedness, 
retinitis pigmentosa, retinitis proliferans, and retinoblastoma, retinochoroiditis, 
retinopathy and retinoschisis. Protein, as well as, antibodies directed against the 
protein may show utility as a tumor marker and/or immunotherapy targets for the 
above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO:32 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence 
would be cumbersome. Accordingly, preferably excluded from the present invention 
are one or more polynucleotides comprising a nucleotide sequence described by the 
general formula of a-b, where a is any integer between 1 to 725 of SEQ ID NO: 32, b 
is an integer of 15 to 739, where both a and b correspond to the positions of 
nucleotide residues shown in SEQ ID NO:32, and where b is greater than or equal to a 
+ 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 23 

The translation product of this gene shares sequence homology with a C. — 
elegans protein of unknown function (See Genbank Accession No.: 
gnllPIDIel348017). When tested against fibroblast cell lines, supernatants removed 
from cells containing this gene activated the EGR1 assay. Thus, it is likely that this 
gene activates fibroblast cells through a signal transduction pathway. Early growth 

response^ 1 (EGR 1 ) is a .promoter associated, with-certain genes-that-induees various 

tissues and cell types upon activation, leading the cells to undergo differentiation and 
proliferation. The gene encoding the disclosed cDNA is thought to reside on 
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chromosome 17. Accordingly, polynucleotides related to this invention are useful as a 
marker in linkage analysis for chromosome 17. 

In specific embodiments, polypeptides of the invention comprise the following 
amino acid sequences: EAKSRPVTQAGVQWHDLGSLQPLPP (SEQ ID NO:236), 
and/or ALVLVCRQRYCRPRDLLQRYDSKPIVDLIGAMETQSEPSELELDDVVIT 
NPHIEAILENEDWIEDASGLMSHCIAILKICHTLTEKLVAMTMGSGAKMKTSA 
SVSDIIVVAKRISPRVDDVVKSMYPPLDPKLLDAR (SEQ ID NO:237). 
Polynucleotides encoding these polypeptides are also encompassed by the invention. 

It has been discovered that this gene is expressed primarily in fast growing 
tissues such as early development stage human tissues, immune/hematopoietic 
tissues, melanocytes, and tumor tissues, and to a lesser extent in some other tissues. 

Therefore, nucleic acids of the invention are useful as reagents for differential 
identification of the tissue(s) or cell type(s) present in a biological sample and for 
diagnosis of the following diseases and conditions: growth disorders, immune and 
inflammatory disoders, skin and; connective tissue disorders, and tumori genesis. 
Similarly, polypeptides and antibodies directed to those polypeptides are useful to 
provide immunological probes for differential identification of the tissue(s) or cell 
type(s). For a number of disorders of the above tissues or cells, particularly of the fast 
growing tissues such as early development stage human tissues, 
immune/hematopoietic tissues, skin and connective tissue, and tumor tissues, 
expression of this gene at significantly higher or lower levels may be detected in 
certain tissues or cell types (e.g., musculoskeletal, skin, immune, developing, 
cancerous and wounded tissues) or bodily fluids „(e ! g., lymph r serum, plasma, urine, - - 
synovial fluid or spinal fluid) taken from an individual having such a disorder, 
relative to the standard gene expression level, i.e., the expression level in healthy 
tissue from an individual not having the disorder. 

Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 

130 as residues^Pro -34 to Ser-4 3, Gh>54jo„Serr60 - 

The tissue distribution suggests that the protein product of this clone is useful 
for the diagnosis and/or treatment of growth disorders, immune and inflammatory 
disorders, and tumorigenesis. Alternatively, the tissue distribution in melanocytes, in 
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conjunction with the observed biological activity data, suggests that the protein 
product of this clone is useful for the treatment, diagnosis, and/or prevention of 
various skin disorders including congenital disorders (i.e. nevi, moles, freckles, 
Mongolian spots, hemangiomas, port-wine syndrome), integumentary tumors (i.e. 
5 keratoses, Bowen's disease, basal cell carcinoma, squamous cell carcinoma, 

malignant melanoma, Paget' s disease, mycosis fungoides, and Kaposi's sarcoma), 
injuries and inflammation of the skin (i.e.wounds, rashes, prickly heat disorder, 
psoriasis, dermatitis), atherosclerosis, uticaria, eczema, photosensitivity, autoimmune 
disorders (i.e. lupus erythematosus, vitiligo, dermatomyositis, morphea, scleroderma, 
10 pemphigoid, and pemphigus), keloids, striae, erythema, petechiae, purpura, and 
xanthelasma. 

Moreover, such disorders may predispose increased susceptibility to viral and 
bacterial infections of the skin (i.e. cold sores, warts, chickenpox, molluscum 
contagiosum, herpes zoster, boils, cellulitis, erysipelas, impetigo, tinea, althletes foot, 
15 and ringworm). Protein, as well as,-antrbodies directed against the protein may show 
utility as a tumor marker and immunotherapy targets for the above listed tumors and 
tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 

20 related to SEQ ID NO:33 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence 
would be cumbersome. Accordingly, preferably excluded from the present invention 
are one or more polynucleotides comprising a nucleotide sequence described by the 

25 general formula of a-b, where a is any integer between 1 to 1448 of SEQ ID NO:33, b 
is an integer of 15 to 1462, where both a and b correspond to the positions of 
nucleotide residues shown in SEQ ID NO:33, and where b is greater than or equal to a 
+ 14. [ 
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FEATURES OF PROTEIN ENCODED BY GENE NO: 24 

When tested against U937 Myeloid cell lines, supernatants removed from cells 
containing this gene activated the GAS assay. Thus, it is likely that this gene activates 
myeloid cells, and to a lesser extent other cells, through the Jak-STAT signal 
5 transduction pathway. The gamma activating sequence (GAS) is a promoter element 
found upstream of many genes which are involved in the Jak-STAT pathway. The 
Jak-STAT pathway is a large, signal transduction pathway involved in the 
differentiation and proliferation of cells. Therefore, activation of the Jak-STAT 
pathway, reflected by the binding of the GAS element, can be used to indicate 
10 proteins involved in the proliferation and differentiation of cells 

In specific embodiments, polypeptides of the invention comprise the following 
amino acid sequences: 

DVESRGPSARCLPVVPGSLLPGLEPATKLMPGGLAPGHG 

APVRELLLPLLSQPTLGSLWDSLRHCSLLCNPLSCVPALEAPPSLVSLGCSGGC 
15 PRLSLAGSASPFPFLTALLSLLNTLAQIHKGLCGQLAAILAAPGLQNYFLQeVA 
PGAAPHLTPFSAWALRHEYHLQYLALALAQKAAALQPLPATHAALYHGMAL 
ALLSRLLPGSEYLTHELLLSCVFRLEFLPERTSGGPEAADFSDQLSLGSSRVPR 
CGQGTLLAQACQDLPSIRNCYLTHCSPARASLLASQALHRGELQRVPTLLLP 
MPTEPLLPTDWPFLH (SEQ ID N 0:238), 

20 DVESRGPSARCLPVVPGSLLPGLEPATKLM PGGLAPGHGAPVRE (SEQ ID 
NO:239), LLLPLLSQPTLGSLWDSLRHCSLLCNP LSCVPALEAPPSLVSLGC 
(SEQ ID NO:240), S GG CPRLS L AGS AS PFPFLT ALL 

SLLNTLAQIHKGLCGQLAAILA (SEQ ID NO: 241), APGLQNYFLQCVAPGAAP 

HLTPFSAWALRHEYHLQYLALALAQK (SEQ ID NO:242), AAALQPLPATHAA 
25 LYHGMALALLSRLLPGSEYLTHELLLSCVFR (SEQ ID NO:243), LEFLPERTSG 
GPEAADFSDQLSLGSSRVPRCGQGTLLAQACQDL (SEQ ID NO:244), and/or 
PSIRNCYLTHCSPARASLLASQALHRGELQRVPTLLLPMPTEPLLPTDWPFLH 

(SEQ ID— NO:245)r- Polynucleotides- encoding— these~polypeptides~are~also — ~~ 

encompassed by the invention. 
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It has been discovered that this gene is expressed primarily in hematopoietic 
tissues and fetal heart tissue, and to a lesser extent in brain and gall bladder tissues, 
and some other tissues. 

Therefore, nucleic acids of the invention are useful as reagents for differential 
identification of the tissue(s) or cell type(s) present in a biological sample and for 
diagnosis of the following diseases and conditions: immune and inflammatory 
disorders, cardiovascular disorders, and growth disorders. Similarly, polypeptides and 
antibodies directed to those polypeptides are useful to provide immunological probes 
for differential identification of the tissue(s) or cell type(s). For a number of disorders 
of the above tissues or cells, particularly of the hematopoietic and vascular systems, 
expression of this gene at significantly higher or lower levels may be detected in 
certain tissues or cell types (e.g., vascular, immune, cancerous and wounded tissues) 
or bodily fluids (e.g., lymph, serum, plasma, urine, synovial fluid or spinal fluid) 
taken from an individual having such a disorder, relative to the standard gene 
expression level, i.er, the expression level in healthy tissue from an individual not 
having the disorder. 

Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 
131 as residues: Tyr-88 to Trp-102, Asp-105 to Ser-110. 

The tissue distribution in hematopoietic tissues, in conjunction with the 
observed biological activity data, indicates that the protein products of this clone are 
useful for the diagnosis and/or treatment of immune and inflammatory disorders and 
growth disorders. Alternatively, the tissue distribution in fetal heart tissue indicates 
that the protein product of this gene is useful for the diagnosis and treatment of 
conditions and pathologies of the cardiovascular system, such as heart disease, 
restenosis, atherosclerosis, stoke, angina, thrombosis, and wound healing. Protein, as 
well as, antibodies directed against the protein may show utility as a tumor marker 
and/or immunotherapy targets for the above listed tissues. 

Many polynucleotide sequencesrsuch as EST sequencesrare publicly — 

available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO:34 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
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excluded from the scope of the present invention. To list every related sequence 
would be cumbersome. Accordingly, preferably excluded from the present invention 
are one or more polynucleotides comprising a nucleotide sequence described by the 
general formula of a-b, where a is any integer between 1 to 2801 of SEQ ID NO:34, b 
5 is an integer of 15 to 2815, where both a and b correspond to the positions of 

nucleotide residues shown in SEQ ID NO:34, and where b is greater than or equal to a 
+ 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 25 

10 In specific embodiments, polypeptides of the invention comprise the following 

amino acid sequences: VGSVLGAFLTFPGLRLAQTHRDALT (SEQ ID NO:246). 

Polynucleotides encoding these polypeptides are also encompassed by the invention. 

The gene encoding the disclosed cDNA is thought to reside on chromosome 19. 

Accordingly, polynucleotides related to this invention are useful as a marker in 
15 linkage analysis forchromosome 19. " 

It has been discovered that this gene is expressed primarily in human pituitary 

tissue. 

Therefore, nucleic acids of the invention are useful as reagents for differential 
identification of the tissue(s) or cell type(s) present in a biological sample and for 

20 diagnosis of the following diseases and conditions: hyperpituitarism and 

hypopituitarism. Similarly, polypeptides and antibodies directed to those polypeptides 
are useful to provide immunological probes for differential identification of the 
tissue(s).or cell type(s). For a number of disorders of the above tissues or cells, 
particularly of the endocrine system, expression of this gene at significantly higher or 

25 lower levels may be detected in certain tissues or cell types (e.g., endocrine, 

cancerous and wounded tissues) or bodily fluids (e.g., lymph, serum, plasma, urine, 
synovial fluid or spinal fluid) taken from an individual having such a disorder, 

relative to the standard-gene expression-level ri.e.,- the expression level in heal thy ~ 

tissue from an individual not having the disorder. This gene is found on the short arm 

30 of chromosome 19 and, therefore, is useful as a chromosome marker. 
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Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 
132 as residues: Met-1 to Pro-6, Gln-89 to Ala-94, Pro- 161 to Cys-173. 

The tissue distribution in pituitary tissue indicates that the protein products of 
this clone are useful for the diagnosis and/or treatment of pituitary disorders. More 
generally, the tissue distribution in pituitary tissue suggests that the protein product of 
this clone is useful for the detection, treatment, and/or prevention of various 
endocrine disorders and cancers, particularly Addison's disease, Cushing's 
Syndrome, and disorders and/or cancers of the pancrease (e.g. diabetes mellitus), 
adrenal cortex, ovaries, pituitary (e.g., hyper-, hypopituitarism), thyroid (e.g. hyper-, 
hypothyroidism), parathyroid (e.g. hyper-, hypoparathyroidism) , hypothalamus, and 
testes. Protein, as well as, antibodies directed against the protein may show utility as a 
tumor marker and/or immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO:35 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence 
would be cumbersome. Accordingly, preferably excluded from the present invention 
are one or more polynucleotides comprising a nucleotide sequence described by the 
general formula of a-b, where a is any integer between 1 to 1064 of SEQ ID NO:35, b 
is an integer of 15 to 1078, where both a and b correspond to the positions of 
nucleotide residues shown in SEQ ID NO:35, and where b is greater than or equal to a 
+ 14. - 

FEATURES OF PROTEIN ENCODED BY GENE NO: 26 

It has been discovered that this gene is expressed highly and specifically in 
placental and bone marrow cDNA libraries, and to a lesser extent in T-cells. 

Therefore,-nucleic acids of the invention are usefuras reagents for differential ~ 

identification of the tissue(s) or cell type(s) present in a biological sample and for 
diagnosis of the following diseases and conditions: immune, developmental and 
reproductive disorders. Similarly, polypeptides and antibodies directed to those 
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polypeptides are useful to provide immunological probes for differential identification 
of the tissue(s) or cell type(s). For a number of disorders of the above tissues or cells, 
particularly of the immune and developing systems, expression of this gene at 
significantly higher or lower levels may be detected in certain tissues or cell types 
(e.g., immune, developmental, reproductive, cancerous and wounded tissues) or 
bodily fluids (e.g., lymph, serum, plasma, urine, synovial fluid or spinal fluid) taken 
from an individual having such a disorder, relative to the standard gene expression 
level, i.e., the expression level in healthy tissue from an individual not having the 
disorder. 

The tissue distribution in bone marrow and placental tissue indicates that the 
protein products of this clone are useful for the diagnosis and/or treatment of immune 
and reproductive disorders. The tissue distribution in bone marrow suggests that the 
protein product of this clone is useful for the treatment and diagnosis of 
hematopoietic related disorders such as anemia, pancytopenia, leukopenia, 
thrombocytopenia or leukemia/The us'es include bone marrow cell ex vivo culture, 
bone marrow transplantation, bone marrow reconstitution, radiotherapy or 
chemotherapy of neoplasia. The gene product may also be involved in lymphopoiesis, 
therefore, it can be used in immune disorders such as infection, inflammation, allergy, 
immunodeficiency etc. In addition, this gene product may have commercial utility in 
the expansion of stem cells and committed progenitors of various blood lineages, and 
in the differentiation and/or proliferation of various cell types. 

Alternatively, the tissue distribution in placental tissue suggests that the 
protein product of this clone is useful for the diagnosis and/or treatment of disorders 
of the placenta. Specific expression within the placenta suggests that this gene 
product may play a role in the proper establishment and maintenance of placental 
function. Alternately, this gene product may be produced by the placenta and then 
transported to the embryo, where it may play a crucial role in the development and/or 
- survival of-the-developing embryo or fetus. — 

Expression of this gene product in a vascular-rich tissue such as the placenta 
also suggests that this gene product may be produced more generally in endothelial 
cells or within the circulation. In such instances, it may play more generalized roles in 
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vascular function, such as in angiogenesis. It may also be produced in the vasculature 
and have effects on other cells within the circulation, such as hematopoietic cells. It 
may serve to promote the proliferation, survival, activation, and/or differentiation of 
hematopoietic cells, as well as other cells throughout the body. Protein, as well as, 
5 antibodies directed against the protein may show utility as a tumor marker and/or 
immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO:36 and may have been publicly available prior to conception of 

10 the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence 
would be cumbersome. Accordingly, preferably excluded from the present invention 
are one or more polynucleotides comprising a nucleotide sequence described by the 
general formula of a-b, where a is any integer between 1 to 1203 of SEQ ID NO:36, b 

15 is an integer of 15 to 1217, where bothra and b correspond to the positions of 

nucleotide residues shown in SEQ ID NO:36, and where b is greater than or equal to a 
+ 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 27 
20 In specific embodiments, polypeptides of the invention comprise the following 

amino acid sequences: 

LECTDTIMVHCSLKLLSPSDXSHSASQVAKTRGVHHXTQ 
XIFKVFFVXMGSHSTKYXSIRPGLLP (SEQ ID NO:247). Polynucleotides 

encoding these polypeptides are also encompassed by the invention. 
25 It has been discovered that this gene is expressed primarily in human prostate 

and smooth muscle tissues. 

Therefore, nucleic acids of the invention are useful as reagents for differential 
identiflcationof the-tissue(s) or cell type(s)-present-in a-biologicalsampleand for- - — 

diagnosis of the following diseases and conditions: disorders in the prostate gland, 
30 vascular and connective tissues. Similarly, polypeptides and antibodies directed to 

those polypeptides are useful to provide immunological probes for differential 
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identification of the tissue(s) or cell type(s). For a number of disorders of the above 
tissues or cells, particularly of the male reproductive and urinary system and vascular 
system, expression of this gene at significantly higher or lower levels may be detected 
in certain tissues or cell types (e.g., reproductive, vascular, cancerous and wounded 
5 tissues) or bodily fluids (e.g., lymph, serum, plasma, urine, synovial fluid or spinal 
fluid) taken from an individual having such a disorder, relative to the standard gene 
expression level, i.e., the expression level in healthy tissue from an individual not 
having the disorder. 

The tissue distribution in prostate and smooth muscle tissues indicates that the 

10 protein products of this clone are useful for the diagnosis and/or treatment of prostate 
gland, vascular and connective tissue disorders. The tissue distribution in smooth 
muscle tissue indicates that the protein product of this gene is useful for the diagnosis 
and treatment of conditions and pathologies of the cardiovascular system, such as 
heart disease, restenosis, atherosclerosis, stoke, angina, thrombosis, and wound 

1 5 healing. The expression in the.prostate'tissue may indicate the gene or its products 

can be used in the disorders of the prostate, including inflammatory disorders, such as 
chronic prostatitis, granulomatous prostatitis and malacoplakia, prostatic hyperplasia 
and prostate neoplastic disorders, including adenocarcinoma, transitional cell 
carcinomas, ductal carcinomas, squamous cell carcinomas, or as hormones or factors 

20 with systemic or reproductive functions. Protein, as well as, antibodies directed 

against the protein may show utility as a tumor marker and/or immunotherapy targets 
for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 

25 related to SEQ ID NO:37 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence 

would be cumbersome .-Aeeordinglyrpreferably excluded from the present invention- 
are one or more polynucleotides comprising a nucleotide sequence described by the 

30 general formula of a-b, where a is any integer between 1 to 1268 of SEQ ID NO: 37, b 
is an integer of 15 to 1282, where both a and b correspond to the positions of 
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nucleotide residues shown in SEQ ID NO:37, and where b is greater than or equal to a 
+ 14. 
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FEATURES OF PROTEIN ENCODED BY GENE NO: 28 

In specific embodiments, polypeptides of the invention comprise the following 
amino acid sequences: ESSFVPPAAHSSLC (SEQ ID NO:248). Polynucleotides 
encoding these polypeptides are also encompassed by the invention. 

It has been discovered that this gene is expressed primarily in human pituitary 

tissue. 

Therefore, nucleic acids of the invention are useful as reagents for differential 
identification of the tissue(s) or cell type(s) present in a biological sample and for 
diagnosis of the following diseases and conditions: hyperpituitarism and 
hypopituitarism. Similarly, polypeptides and antibodies directed to those polypeptides 
are useful to provide immunological probes for differential identification of the 
tissue(s) or cell type(s). For a number of disorders of the above tissues or cells, 
particularly of the endocrine system, expression of this gene at significantly higher or 
lower levels may be detected in certain tissues or cell types (e.g., endocrine, 
cancerous and wounded tissues) or bodily fluids (e.g., lymph, serum, plasma, urine, 
synovial fluid or spinal fluid) taken from an individual having such a disorder, 
relative to the standard gene expression level, i.e., the expression level in healthy 
tissue from an individual not having the disorder. 

The tissue distribution in pituitary tissue indicates that the protein products of 
this clone are useful for the diagnosis and/or treatment of pituitary gland disorders 
such as hypopituitarism and hyperpituitarism. More generally, the tissue distribution 
in pituitary tissue suggests that the protein product of this clone is useful for the 

detection, treatment, and/or prevention of-various endocrine disorders and cancers, 

particularly Addison's disease, Cushing's Syndrome, and disorders and/or cancers of 
the pancrease (e.g. diabetes mellitus), adrenal cortex, ovaries, pituitary (e.g., hyper-, 
hypopituitarism), thyroid (e.g. hyper-, hypothyroidism), parathyroid (e.g. hyper-, 
hypoparathyroidism) , hypothallamus, and testes. Protein, as well as, antibodies 

direeted-against the-protein-may-show-utility-as-a-tumor-marker-and/or 

immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
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related to SEQ ID NO: 38 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence 
would be cumbersome. Accordingly, preferably excluded from the present invention 
5 are one or more polynucleotides comprising a nucleotide sequence described by the 
general formula of a-b, where a is any integer between 1 to 545 of SEQ ID NO:38, b 
is an integer of 15 to 559, where both a and b correspond to the positions of 
nucleotide residues shown in SEQ ID NO:38, and where b is greater than or equal to a 
+ 14. 

10 

FEATURES OF PROTEIN ENCODED BY GENE NO: 29 

In specific embodiments, polypeptides of the invention comprise the following 
amino acid sequences: 

LLPGQQEATQCVEAGAGEGALTPMCPCRQEQFVDLYKEF 

15 EPSLVNSTVYIMAMAIQMAPFAINYKVRPGPCXNIHCLPTQPHPMKPSVPHPH 
RARPSWRACPRTSPWCGVWQFHSWPSLACSSAPRPTSTASLASWTSLWSSS 
WSLPRSCSWTSAWRSWPTASCSSSWGPRS (SEQ ID NO:249), 
LLPGQQEATQCV EAGAGEGALTPMGPCRQEQFVDLYKEFEPSLVN (SEQ ID 
NO:250), STVYIMAMAIQMAPFAINYKVRPGPCXNIHCLPTQPHPMKPSVP 

20 (SEQ ID N O : 2 5 1 ) , 

HPHRARPSWRACPRTSPWCGVWQFHSWPSLACSSAPRPTSTA (SEQ ID 
NO:252), and/or SLASWTSLWSSSWSLPRSCSWTSAWRSWPTASCSSSWG PRS 
(SEQ ID. ..NO.: 25 3). Polynucleotides encoding these polypeptides are also 
encompassed by the invention. 

25 It has been discovered that this gene is expressed primarily in human pituitary 

and breast tissues, and to a lesser extent in endometrial and ovarian cancer tissues. 

Therefore, nucleic acids of the invention are useful as reagents for differential 

identification of-the tissue(s) or eell-type(s)-presentin a biological sample and for 

diagnosis of the following diseases and conditions: hyperpituitarism and 

30 hypopituitarism, and cancers of the female reproductive system. Similarly, 

polypeptides and antibodies directed to those polypeptides are useful to provide 
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immunological probes for differential identification of the tissue(s) or cell type(s). For 
a number of disorders of the above tissues or cells, particularly of the endocrine and 
reproductive systems, expression of this gene at significantly higher or lower levels 
may be detected in certain tissues or cell types (e.g., endocrine, reproductive, 
cancerous and wounded tissues) or bodily fluids (e.g., lymph, serum, plasma, urine, 
synovial fluid or spinal fluid) taken from an individual having such a disorder, 
relative to the standard gene expression level, i.e., the expression level in healthy 
tissue from an individual not having the disorder. 

Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 
136 as residues: Ser-3 to Lys-8. 

The tissue distribution in pituitary tissue indicates that the protein products of 
this clone are useful for the diagnosis and/or treatment of disorders in the pituitary 
gland. More generally, the tissue distribution in pituitary tissue suggests that the 
protein product of this clone is useful for the detection, treatment, and/or prevention 
of various endocrine disorders and cancers, particularly Addison's disease, Cushing's 
Syndrome, and disorders and/or cancers of the pancrease (e.g. diabetes mellitus), 
adrenal cortex, ovaries, pituitary (e.g., hyper-, hypopituitarism), thyroid (e.g. hyper-, 
hypothyroidism), parathyroid (e.g. hyper-, hypoparathyroidism) , hypothalamus, and 
testes. Alternatively, the tissue distribution in breast tissue and cancerous tissues of 
the endometrium and ovaries suggests that the translation product of this gene is 
useful for the detection and/or treatment of disorders and cancers of the female 
reproductive system, as well as cancers of other tissues where expression has been 
observed. Protein, as well.as, antibodies directed against the protein may show utility 
as a tumor marker and/or immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO:39 and may have been publicly available prior to conception of 

_ the present Jnvention.-Preferably,-sueh-reIated-polyn^ 

excluded from the scope of the present invention. To list every related sequence 
would be cumbersome. Accordingly, preferably excluded from the present invention 
are one or more polynucleotides comprising a nucleotide sequence described by the 
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general formula of a-b, where a is any integer between 1 to 789 of SEQ ID NO:39, b 
is an integer of 15 to 803, where both a and b correspond to the positions of 
nucleotide residues shown in SEQ ID NO:39, and where b is greater than or equal to a 
+ 14. 

5 

FEATURES OF PROTEIN ENCODED BY GENE NO: 30 

In specific embodiments, polypeptides of the invention comprise the following 
amino acid sequences: TRNILSFIKCVIHNFWIPKESNEITIIINPYRETVCFSVEP 
VKKIFNY (SEQ ID NO:254). Polynucleotides encoding these polypeptides are also 
10 encompassed by the invention. 

It has been discovered that this gene is expressed primarily in human synovial 
sarcoma tissue. 

Therefore, nucleic acids of the invention are useful as reagents for differential 
identification of the tissue(s) or cell type(s) present in a biological sample. Similarly, 
15 polypeptides and antibodies directed to" those polypeptides are useful to provide 

immunological probes for differential identification of the tissue(s) or cell type(s). For 
a number of disorders of the above tissues or cells, particularly of the skeletal system, 
expression of this gene at significantly higher or lower levels may be detected in 
certain tissues or cell types (e.g., skeletal, connective, cancerous and wounded tissues) 
20 or bodily fluids (e.g., lymph, serum, plasma, urine, synovial fluid or spinal fluid) 
taken from an individual having such a disorder, relative to the standard gene 
expression level, i.e., the expression level in healthy tissue from an individual not 

..„ having.the disorder — — 

Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 
25 137 as residues: Thr-29 to Pro-34. 

The tissue distribution in synovial sarcoma tissue indicates that the protein 
products of this clone are useful for the diagnosis and/or treatment of diseases of the 

, synovium. -In-addition^the 

Expression of this gene product in synovium suggests a role in the detection 
30 and treatment of disorders and conditions affecting the skeletal system, in particular 
osteoporosis as well as disorders afflicting connective tissues (e.g. arthritis, trauma, 
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tendonitis, chrondomalacia and inflammation), such as in the diagnosis or treatment 
of various autoimmune disorders such as rheumatoid arthritis, lupus, scleroderma, and 
dermatomyositis as well as dwarfism, spinal deformation, and specific joint 
abnormalities as well as chondrodysplasias (ie. spondyloepiphyseal dysplasia 
congenita, familial arthritis, Atelosteogenesis type II, metaphyseal chondrodysplasia 
type Schmid). Protein, as well as, antibodies directed against the protein may show 
utility as a tumor marker and/or immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO:40 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence 
would be cumbersome. Accordingly, preferably excluded from the present invention 
are one or more polynucleotides comprising a nucleotide sequence described by the 
general formula of a-b, where.als any Integer between 1 to 1496 of SEQ ID NO:40, b 
is an integer of 15 to 1510, where both a and b correspond to the positions of 
nucleotide residues shown in SEQ ID NO:40, and where b is greater than or equal to a 
+ 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 31 

In specific embodiments, polypeptides of the invention comprise the following 
amino acid sequences: LVVLFASSNSRYLKYFFLVPLILGSAW (SEQ ID NO:255). 
Polynucleotides encoding these polypeptides are also encompassed by the invention. 

It has been discovered that this gene is expressed primarily in human 
rhabdomyosarcoma and fetal liver/spleen tissues. 

Therefore, nucleic acids of the invention are useful as reagents for differential 
identification of the tissue(s) or cell type(s) present in a biological sample and for 

_ diagnosis-of-the-following-diseases and-conditions:malignant neoplasmsand 

hematopoiesis. Similarly, polypeptides and antibodies directed to those polypeptides 
are useful to provide immunological probes for differential identification of the 
tissue(s) or cell type(s). For a number of disorders of the above tissues or cells, 
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particularly of the skeletal and immune system, expression of this gene at 
significantly higher or lower levels may be detected in certain tissues or cell types 
(e.g., musculoskeletal, immune, cancerous and wounded tissues) or bodily fluids 
(e.g., lymph, serum, plasma, urine, synovial fluid or spinal fluid) taken from an 
individual having such a disorder, relative to the standard gene expression level, i.e., 
the expression level in healthy tissue from an individual not having the disorder. 

Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 
138 as residues: Gly-29 to Thr-35. 

The tissue distribution in rhabdomyosarcoma and fetal liver/spleen tissues 
indicates that the protein products of this clone are useful for diagnosis and treatment 
of skeletal and immune disorders. The expression in rhabdomyosarcoma tissue 
suggests that the protein product of this clone is useful for the detection, treatment, 
and/or prevention of various muscle disorders, such as muscular dystrophy, 
cardiomyopathy, fibroids, myomas, and rhabdomyosarcomas. Alternatively, 

Expression of this gene producfin fetal liver/spleen tissue suggests a role in 
the regulation of the proliferation; survival; differentiation; and/or activation of 
potentially all hematopoietic cell lineages, including blood stem cells. This gene 
product may be involved in the regulation of cytokine production, antigen 
presentation, or other processes that may also suggest a usefulness in the treatment of 
cancer (e.g. by boosting immune responses). 

Since the gene is expressed in cells of lymphoid origin, the gene or protein, as 
well as, antibodies directed against the protein may show utility as a tumor marker 
and/or immunotherapy targets for the above listed tissues. Therefore it may be also 
used as an agent for immunological disorders including arthritis, asthma, immune 
deficiency diseases such as AIDS, leukemia, rheumatoid arthritis, inflammatory 
bowel disease, sepsis, acne, and psoriasis. In addition, this gene product may have 
commercial utility in the expansion of stem cells and committed progenitors of 
- various blood lineagesrand in the dif ferentiatioirand/or proli f^tibfTdf various cell" 
types. Protein, as well as, antibodies directed against the protein may show utility as a 
tumor marker and/or immunotherapy targets for the above listed tissues. 
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Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO:41 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence 
would be cumbersome. Accordingly, preferably excluded from the present invention 
are one or more polynucleotides comprising a nucleotide sequence described by the 
general formula of a-b, where a is any integer between 1 to 1081 of SEQ ID NO:41, b 
is an integer of 15 to 1095, where both a and b correspond to the positions of 
nucleotide residues shown in SEQ ID NO:41, and where b is greater than or equal to a 
+ 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 32 

It has been discovered that this gene is expressed primarily in fibrosarcoma 
tissue. - " " - 

Therefore, nucleic acids of the invention are useful as reagents for differential 
identification of the tissue(s) or cell type(s) present in a biological sample and for 
diagnosis of the following diseases and conditions: fibrosarcoma. Similarly, 
polypeptides and antibodies directed to those polypeptides are useful to provide 
immunological probes for differential identification of the tissue(s) or cell type(s). For 
a number of disorders of the above tissues or cells, particularly of the connective 
tissue system, expression of this gene at significantly higher or lower levels may be 

-detected in certain tissues or cell types (e.g., musculoskeletal, cancerous and 

wounded tissues) or bodily fluids (e.g., lymph, serum, plasma, urine, synovial fluid or 
spinal fluid) taken from an individual having such a disorder, relative to the standard 
gene expression level, i.e., the expression level in healthy tissue from an individual 
not having the disorder. 

Preferred"epitop a sequence~showh iiTSEQ ID NO. 

139 as residues: Ser-34 to Gln-40, Gly-42 to Glu-48, Tyr-56 to Leu-62. 

The tissue distribution in only fibrosarcoma tissue suggests that the protein 
product of this clone is useful for the treatment, diagnosis and/or prognosis of 
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fibrosarcoma^ or other diorders related with fibrous tissue including fibroma, 
fibromatosis, fibromyoma, fibromyositis, fibrosis and fibrositis. Likewise, the 
expression in fibrosarcoma tissue suggests that the protein product of this clone is 
useful for the detection, treatment, and/or prevention of various muscle disorders, 
5 such as muscular dystrophy, cardiomyopathy, myomas, and rhabdomyosarcomas. 

Protein, as well as, antibodies directed against the protein may show utility . as a tumor 
marker and/or immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 

10 related to SEQ ID NO:42 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence 
would be cumbersome. Accordingly, preferably excluded from the present invention 
are one or more polynucleotides comprising a nucleotide sequence described by the 

15 general formula of a-b, where a is. any integer between 1 to 11 48 of SEQ ID NO:42, b 
is an integer of 15 to 1 162, where both a and b correspond to the positions of 
nucleotide residues shown in SEQ ID NO:42, and where b is greater than or equal to a 
+ 14. 

20 FEATURES OF PROTEIN ENCODED BY GENE NO: 33 

It has been discovered that this gene is expressed primarily in Hodgkins 
lymphoma and breast cancer tissues, and to a lesser extent in stromal cells and brain 
tissue. 

Therefore, nucleic acids of the invention are useful as reagents for differential 
25 identification of the tissue(s) or cell type(s) present in a biological sample and for 
diagnosis of the following diseases and conditions: lymphoma, breast cancer, and 
neurological disorders. Similarly, polypeptides and antibodies directed to those 

-polypeptides are useful-to-provide immunological probes for differential identification" 

of the tissue(s) or cell type(s). For a number of disorders of the above tissues or cells, 
30 particularly of the immune amd nervous systems, expression of this gene at 

significantly higher or lower levels may be detected in certain tissues or cell types 
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(e.g., immune, neural, cancerous and wounded tissues) or bodily fluids (e.g., lymph, 
serum, plasma, urine, synovial fluid or spinal fluid) taken from an individual having 
such a disorder, relative to the standard gene expression level, i.e., the expression 
level in healthy tissue from an individual not having the disorder. 
5 Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 

140 as residues: Pro-22 to Lys-29. 

The tissue distribution in Hodgkins lymphoma, brain and breast cancer tissues 
suggests a role in the treatment, diagnosis and/or prognosis of breast cancer, immune 
and hematopoietic disorders including arthritis, asthma, immunodeficiency diseases, 

10 leukemia and Hodgkin's lymphoma and neurodegenerative disease states and 

behavioral disorders such as Alzheimer's Disease, Parkinson's Disease, Huntington's 
Disease, schizophrenia, mania, dementia, paranoia, obsessive compulsive disorder 
and panic disorder. Protein, as well as, antibodies directed against the protein may 
show utility as a tumor marker and/or immunotherapy targets for the above listed 

15 tissues. ~ . " 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO:43 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 

20 excluded from the scope of the present invention. To list every related sequence 

would be cumbersome. Accordingly, preferably excluded from the present invention 
are one or more polynucleotides comprising a nucleotide sequence described by the 
general formula of a-b, where a is any integer between 1 to 643 of SEQ ID NO:43, b 
is an integer of 15 to 657, where both a and b correspond to the positions of 

25 nucleotide residues shown in SEQ ID NO:43, and where b is greater than or equal to a 
+ 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 34 

In specific embodiments, polypeptides of the invention comprise the following 
30 amino acid sequences: HEWKCKQKYSEGSGNTRIGN (SEQ ID NO:256). 

Polynucleotides encoding these polypeptides are also encompassed by the invention. 
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It has been discovered that this gene is expressed primarily in chronic 
synovitis tissue, and to a lesser extent in fetal kidney and testes tissues. 

Therefore, nucleic acids of the invention are useful as reagents for differential 
identification of the tissue(s) or cell type(s) present in a biological sample and for 
diagnosis of the following diseases and conditions: synovitis, renal disorders and male 
infertility. Similarly, polypeptides and antibodies directed to those polypeptides are 
useful to provide immunological probes for differential identification of the tissue(s) 
or cell type(s). For a number of disorders of the above tissues or cells, particularly of 
the connective tissue system, the renal system, and the male reproductive system, 
expression of this gene at significantly higher or lower levels may be detected in 
certain tissues or cell types (e.g., skeletal, renal, reproductive, cancerous and wounded 
tissues) or bodily fluids (e.g., lymph, serum, plasma, urine, synovial fluid or spinal 
fluid) taken from an individual having such a disorder, relative to the standard gene 
expression level, i.e., the expression level in healthy tissue from an individual not 
having the disorder" 

Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 
141 as residues: Met-33 to Pro-39, Ser-74 to Trp-79. 

The tissue distribution of this gene in chronic synovitis, testes, and kidneys 
suggests a role in the treatment, diagnosis and prognosis of synovial membrane 
disorders including synovitis, renal disorders including kidney failure, renal colic, 
renal diabetes, hypertension, osteodystrophy, tubular acidosis and kidney stones; and 
and male infertility. Furthermore, the tissue distribution in testes tissue indicates that 
the protein product of this clone is useful for the treatment and/or diagnosis of 
conditions concerning proper testicular function (e.g. endocrine function, sperm 
maturation), as well as cancer. Therefore, this gene product is useful in the treatment 
of male infertility and/or impotence. This gene product is also useful in assays 
designed to identify binding agents, as such agents (antagonists) are useful as male 
contraceptive~agents7Simi^ 

and/or diagnosis of testicular cancer. The testes are also a site of active gene 
expression of transcripts that may be expressed, particularly at low levels, in other 
tissues of the body. Therefore, this gene product may be expressed in other specific 
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tissues or organs where it may play related functional roles in other processes, such as 
hematopoiesis, inflammation, bone formation, and kidney function, to name a few 
possible target indications. In addition, the 

Expression of this gene product in synovium suggests a role in the detection 
and/or treatment of disorders and conditions affecting the skeletal system, in 
particular osteoporosis as well as disorders afflicting connective tissues (e.g. arthritis, 
trauma, tendonitis, chrondomalacia and inflammation), such as in the diagnosis or 
treatment of various autoimmune disorders such as rheumatoid arthritis, lupus, 
scleroderma, and dermatomyositis as well as dwarfism, spinal deformation, and 
specific joint abnormalities as well as chondrodysplasias (ie. spondyloepiphyseal 
dysplasia congenita, familial arthritis, Atelosteogenesis type II, metaphyseal 
chondrodysplasia type Schmid). Protein, as well as, antibodies directed against the 
protein may show utility as a tumor marker and/or immunotherapy targets for the 
above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO:44 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence 
would be cumbersome. Accordingly, preferably excluded from the present invention 
are one or more polynucleotides comprising a nucleotide sequence described by the 
general formula of a-b, where a is any integer between 1 to 1 141 of SEQ ID NO:44, b 
is an integer of 15 to 1 155, where both a and b correspond to the positions of 
nucleotide residues shown in SEQ ID NO:44, and where b is greater than or equal to a 
+ 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 35 

In specific embodiments, polypepddes~oFthe invention comprise the^following 
amino acid sequences: LLPLCFLGPRQVLEEFPSIV (SEQ ID NO:257). 
Polynucleotides encoding these polypeptides are also encompassed by the invention. 



WO 99/47540 



PCT/US99/05804 



66 

It has been discovered that this gene is- expressed primarily in brain tissue, and 
to a lesser extent in osteoclastoma and testes tissues. 

Therefore, nucleic acids of the invention are useful as reagents for differential 
identification of the tissue(s) or cell type(s) present in a biological sample and for 
diagnosis of the following diseases and conditions: neurological disorders and male 
reproductive disorders. Similarly, polypeptides and antibodies directed to those 
polypeptides are useful to provide immunological probes for differential identification 
of the tissue(s) or cell type(s). For a number of disorders of the above tissues or cells, 
particularly of the nervous system and the male reproductive system, expression of 
this gene at significantly higher or lower levels may be detected in certain tissues or 
cell types (e.g., neural, reproductive, cancerous and wounded tissues) or bodily fluids 
(e.g., lymph, serum, plasma, urine, synovial fluid or spinal fluid) taken from an 
individual having such a disorder, relative to the standard gene expression level, i.e., 
the expression level in healthy tissue from an individual not having the disorder. 

The tissue distribution of this gene in brain tissue suggests a role in the 
diagnosis, prognosis and/or treatment of neurodegenerative disease states and 
behavioural disorders such as Alzheimer's Disease, Parkinson's Disease, Huntinton's 
Disease, schizophrenia, mania, dementia, paranoia, obsessive compulsive disorder 
and panic disorder. In addition, the gene or gene product may also play a role in the 
treatment and/or detection of developmental disorders associated with the developing 
embryo, or sexually-linked disorders. Protein, as well as, antibodies directed against 
the protein may show utility as a tumor marker and/or immunotherapy targets for the 
above listed tissues; - _ 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO:45 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the~s^ 

would be cumbersome. Accordingly, preferably excluded from the present invention 
are one or more polynucleotides comprising a nucleotide sequence described by the 
general formula of a-b, where a is any integer between 1 to 1098 of SEQ ID NO:45, b 
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is an integer of 15 to 1 1 12, where both a and b correspond to the positions of 
nucleotide residues shown in SEQ ID NO:45, and where b is greater than or equal to a 
+ 14. 

5 FEATURES OF PROTEIN ENCODED BY GENE NO: 36 

In specific embodiments, polypeptides of the invention comprise the following 
amino acid sequences: PTRPS KHQE AGS (SEQ ID NO:258). Polynucleotides 
encoding these polypeptides are also encompassed by the invention. The gene 
encoding the disclosed cDNA is thought to reside on chromosome 3. Accordingly, 
10 polynucleotides related to this invention are useful as a marker in linkage analysis for 
chromosome 3. 

It has been discovered that this gene is expressed primarily in adult and fetal 
heart tissue, and to a lesser extent in fetal lung and fetal liver/spleen tissues. 

Therefore, nucleic acids of the invention are useful as reagents for differential 

15 identification of thelissue(s) or cell type(s) present in a biological sample and for 
diagnosis of the following diseases and conditions: cardiovascular and immune 
disorders. Similarly, polypeptides and antibodies directed to those polypeptides are 
useful to provide immunological probes for differential identification of the tissue(s) 
or cell type(s). For a number of disorders of the above tissues or cells, particularly of 

20 the vascular and immune systems, expression of this gene at significantly higher or 
lower levels may be detected in certain tissues or cell types (e.g., vascular, immune, 
pulmonary, cancerous and wounded tissues) or bodily fluids (e.g., lymph, serum, 
plasma, urine, synovial fluid or spinal fluid) taken from an individual having such a 
disorder, relative to the standard gene expression level, i.e., the expression level in 

25 healthy tissue from an individual not having the disorder. 

Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 
143 as residues: Val-2 to Ser-14. 

Thctissue" distribution in heaftrfetal liver andTetaTspleen tissues^ suggests~a 
role in the treatment and/or diagnosis of cardiovascular disorders including 

30 myocardial infarction, congestive heart failure, coronary failure, as well as immune 
disorders including autoimmune diseases, such as lupus, transplant rejection, allergic 
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reactions, arthritis, asthma, immunodeficiency diseases, leukemia, and AIDS. 
Furthermore, the tissue distribution in adult and fetal heart tissue indicates that the 
protein product of this gene is useful for the diagnosis and treatment of conditions and 
pathologies of the cardiovascular system, such as heart disease, restenosis, 
atherosclerosis, stoke, angina, thrombosis, and wound healing. Protein, as well as, 
antibodies directed against the protein may show utility as a tumor marker and/or 
immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO:46 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence 
would be cumbersome. Accordingly, preferably excluded from the present invention 
are one or more polynucleotides comprising a nucleotide sequence described by the 
general formula of a-b, where a is any integer between 1 to 4009 of SEQ ID NO:46, b 
is an integer of 15 to 4023, where both a and b correspond to the positions of 
nucleotide residues shown in SEQ ID NO:46, and where b is greater than or equal to a 
+ 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 37 

It has been discovered that this gene is expressed primarily in testes tissues. 
Therefore, nucleic acids of the invention are useful as reagents for differential 
identification of the tissue(s) or cell type(s) present in a biological sample and for 
diagnosis of the following diseases and conditions: male infertility and reproductive 
disorders. Similarly, polypeptides and antibodies directed to those polypeptides are 
useful to provide immunological probes for differential identification of the tissue(s) 
or cell type(s). For a number of disorders of the above tissues or cells, particularly of 
- the malereproducti vesy stem, expressionof thislje^^ DFlower ' ~ 
levels may be detected in certain tissues or cell types (e.g., reproductive, cancerous 
and wounded tissues) or bodily fluids (e.g., lymph, serum, plasma, urine, synovial 
fluid or spinal fluid) taken from an individual having such a disorder, relative to the 
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standard gene expression level, i.e., the expression level in healthy tissue from an 
individual not having the disorder. 

The tissue distribution in testes tissues suggests a role in the treatment and/or 
diagnosis of male infertility, and testicular disorders including cancer. Furthermore, 
the tissue distribution in testes tissue indicates that the protein product of this clone is 
useful for the treatment and diagnosis of conditions concerning proper testicular 
function (e.g. endocrine function, sperm maturation), as well as cancer. Therefore, 
this gene product is useful in the treatment of male infertility and/or impotence. This 
gene product is also useful in assays designed to identify binding agents, as such 
agents (antagonists) are useful as male contraceptive agents. Similarly, the protein is 
believed to be useful in the treatment and/or diagnosis of testicular cancer. The testes 
are also a site of active gene expression of transcripts that may be expressed, 
particularly at low levels, in other tissues of the body. Therefore, this gene product 
may be expressed in other specific tissues or organs where it may play related 
functional roles in other processes, "such as hematopoiesis, inflammation, bone 
formation, and kidney function, to name a few possible target indications. Protein, as 
well as, antibodies directed against the protein may show utility as a tumor marker 
and/or immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO:47 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence 
would be cumbersome. Accordingly, preferably excluded from the present invention 
are one or more polynucleotides comprising a nucleotide sequence described by the 
general formula of a-b, where a is any integer between 1 to 528 of SEQ ID NO:47, b 
is an integer of 15 to 542, where both a and b correspond to the positions of 
nucleotide fesidues"sHowfrin TSEQ~ID"N0747randlvhere S is 
+ 14. 
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FEATURES OF PROTEIN ENCODED BY GENE NO: 38 

It has been discovered that this gene is expressed primarily in apoptotic T- 
cells, and to a lesser extent in brain tissue. 

Therefore, nucleic acids of the invention are useful as reagents for differential 
5 identification of the tissue(s) or cell type(s) present in a biological sample and for 
diagnosis of the following diseases and conditions: immune and neurological 
disorders. Similarly, polypeptides and antibodies directed to those polypeptides are 
useful to provide immunological probes for differential identification of the tissue(s) 
or cell type(s). For a number of disorders of the above tissues or cells, particularly of 

10 the immune and nervous systems, expression of this gene at significantly higher or 
lower levels may be detected in certain tissues or cell types (e.g., immune, neural, 
cancerous and wounded tissues) or bodily fluids (e.g., lymph, serum, plasma, urine, 
synovial fluid or spinal fluid) taken from an individual having such a disorder, 
relative to the standard gene expression level, i.e., the expression level in healthy 

15 tissue from an individual: not having the disorder. 

Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 
145 as residues: Glu-33 to Tyr-42. 

The tissue distribution in apoptotic T-cells suggests potential roles in the 
treatment and/or diagnosis of immune disorders including of immune and 

20 autoimmune diseases, such as lupus, transplant rejection, allergic reactions, arthritis, 
asthma, immunodeficiency diseases, leukemia, and AIDS. Alternatively, expression 
in brain tissue suggests a role in the treatment and/or diagnosis of neurodegenerative 
_ disease states and behavioural disorders such as Alzheimer's Disease, Parkinson's- 
Disease, Huntinton's Disease, schizophrenia, mania, dementia, paranoia, obsessive 

25 compulsive disorder and panic disorder. Furthermore, the tissue distribution in 
apoptotic T-cells indicates that the translation product of this gene may also be 
involved in apoptosis or tissue differentiation and could again be useful in cancer 

therapy.-Protein,-as well as, antibodies-directedagainsrthe protein"may"show~utilityas~ 

a tumor marker and/or immunotherapy targets for the above listed tissues. 

30 Many polynucleotide sequences, such as EST sequences, are publicly 

available and accessible through sequence databases. Some of these sequences are 
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related to SEQ ID NO:48 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence 
would be cumbersome. Accordingly, preferably excluded from the present invention 
5 are one or more polynucleotides comprising a nucleotide sequence described by the 
general formula of a-b, where a is any integer between 1 to 1481 of SEQ ID NO:48, b 
is an integer of 15 to 1495, where both a and b correspond to the positions of 
nucleotide residues shown in SEQ ID NO:48, and where b is greater than or equal to a 
+ 14. 

10 

FEATURES OF PROTEIN ENCODED BY GENE NO: 39 

The translation product of this gene shares sequence homology with 
phosphomannomutase, which is thought to be important in mannose matabolism. 

It has been discovered that this gene is expressed primarily in meningioma and 
15 testis tissues. ~~ . 

Therefore, nucleic acids of the invention are useful as reagents for differential 
identification of the tissue(s) or cell type(s) present in a biological sample and for 
diagnosis of the following diseases and conditions: meningioma related diseases. 
Similarly, polypeptides and antibodies directed to those polypeptides are useful to 
20 provide immunological probes for differential identification of the tissue(s) or cell 
type(s). For a number of disorders of the above tissues or cells, particularly of the 
central nervous system, expression of this gene at significantly higher or lower levels 

may be detected in certain tissues or cell types (e.g., neural-cancerous and wounded — 

tissues) or bodily fluids (e.g., lymph, serum, plasma, urine, synovial fluid or spinal 
25 fluid) taken from an individual having such a disorder, relative to the standard gene 
expression level, i.e., the expression level in healthy tissue from an individual not 
having the disorder. 

Preferred epitopes include those comprising a-sequence shownin SEQ-ID-NO. 

146 as residues: Ser-33 to Lys-43. 
30 The tissue distribution in meningioma, and the homology to 

phosphomannomutase, suggests that the protein product of this clone is useful for the 
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diagnosis and/or intervention of meningioma related diseases. For example, the gene 
product can be used for preventing microbial infection of the meninges, for imaging 
conjugates, or as a secretory factor as a endocrine with systemic, central or peripheral 
nerve functions. Protein, as well as, antibodies directed against the protein may show 
utility as a tumor marker and/or immunotherapy targets for the above listed tissues. 

Many poly ^nucleotide sequences, such a^^ sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO:49 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence 
would be cumbersome. Accordingly, preferably excluded from the present invention 
are one or more polynucleotides comprising a nucleotide sequence described by the 
general formula of a-b, where a is any integer between 1 to 804 of SEQ ID NO:49, b 
is an integer of 15 to 818, where both a and b correspond to the positions of 
nucleotide residues shown in SEQ ID NO:49, and where b is greater than or equal to a 
+ 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 40 

It has been discovered that this gene is expressed primarily in tonsils, 
osteoclastoma and retinoic acid treated teratocarcinoma cells, and to a lesser extent in 
macrophages, female bladder, adipose tissue, myeloid progenitor cells, prostate tissue, 
and number of other tissues and organs. 

Therefore,-nucleic-acids of-the-inventi on-are-useful as reagents for differential - 

identification of the tissue(s) or cell type(s) present in a biological sample and for 
diagnosis of the following diseases and conditions: tonsils and osteoclast related 
diseases. Similarly, polypeptides and antibodies directed to those polypeptides are 
useful to provide immunological probes for differential identification of the tissue(s) 
or ceII type(s)rFora number of disorders of the ato — 
the immune and bone systems, expression of this gene at significantly higher or lower 
levels may be detected in certain tissues or cell types (e.g., immune, skeletal, 
cancerous and wounded tissues) or bodily fluids (e.g., lymph, serum, plasma, urine, 
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synovial fluid or spinal fluid) taken from an individual having such a disorder, 
relative to the standard gene expression level, i.e., the expression level in healthy 
tissue from an individual not having the disorder. 

Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 
147 as residues: Glu-55 to Arg-61, Gln-84 to Ser-92, Ser-99 to Ser-104. 

The tissue distribution in tonsils and osteoclastoma suggests that the protein 
product of this clone is useful for the diagnosis and/or intervention of diseases related 
to tonsils or osteoclasts. For example, tonsillitis, adenoids, peritonsilar abscess, 
neoplasms, or bone related disorders like rickets, abnormalities of bone growth and 
modelling, facture, osteonecrosis, and osteoporosis etc. Expression of this gene 
product in osteoclastoma suggests that it may play a role in the survival, proliferation, 
and/or growth of osteoclasts. Therefore, it may be useful in influencing bone mass in 
such conditions as osteoporosis. 

Alternatively, the expression of this gene product in tonsils suggests a role in 
the regulation of the' proliferation; survival; differentiation; and/or activation of 
potentially all hematopoietic cell lineages, including blood stem cells. This gene 
product may be involved in the regulation of cytokine production, antigen 
presentation, or other processes that may also suggest a usefulness in the treatment of 
cancer (e.g. by boosting immune responses). 

Since the gene is expressed in cells of lymphoid origin, the gene or protein, as 
well as, antibodies directed against the protein may show utility as a tumor marker 
and/or immunotherapy targets for the above listed tissues. Therefore it may be also 
used as an agent for immunological disorders including arthritis, asthma, immune 
deficiency diseases such as AIDS, leukemia, rheumatoid arthritis, inflammatory 
bowel disease, sepsis, acne, and psoriasis. In addition, this gene product may have 
commercial utility in the expansion of stem cells and committed progenitors of 
various blood lineages, and in the differentiation and/or proliferation of various cell 
types: Protein, as" well as; ffltibodies^directed against the proteiriTiiay show"utility as a 
tumor marker and/or immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
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related to SEQ ID NO:50 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence 
would be cumbersome. Accordingly, preferably excluded from the present invention 
5 are one or more polynucleotides comprising a nucleotide sequence described by the 
general formula of a-b, where a is any integer between 1 to 1697 of SEQ ID NO:50, b 
is an integer of 15 to 171 1, where both a and b correspond to the positions of 
nucleotide residues shown in SEQ ID NO: 50, and where b is greater than or equal to a 
+ H. 

10 

FEATURES OF PROTEIN ENCODED BY GENE NO: 41 

It has been discovered that this gene is expressed primarily in resting T-cells. 
Therefore, nucleic acids of the invention are useful as reagents for differential 
identification of the tissue(s) or cell type(s) present in a biological sample and for 

15 diagnosis of the following diseases and conditions: T-cell -related disorders. Similarly, 
polypeptides and antibodies directed to those polypeptides are useful to provide 
immunological probes for differential identification of the tissue(s) or cell type(s). For 
a number of disorders of the above tissues or cells, particularly of the immune system, 
expression of this gene at significantly higher or lower levels may be detected in 

20 certain tissues or cell types (e.g., immune, cancerous and wounded tissues) or bodily 
fluids (e.g., lymph, serum, plasma, urine, synovial fluid or spinal fluid) taken from an 
individual having such a disorder, relative to the standard gene expression level, i.e., 

the expression-level in healthy tissue from an individual not having the disorder. 

The tissue distribution in resting T-cells suggests that the protein product of 

25 this clone is useful for the diagnosis and/or intervention of T-cell related disorders, 
such as infection, inflammation, allergy, tissue/organ transplantation, immune 
deficiency etc. Furthermore, the expression of this gene product in T cells also 
stronglysuggestsarolefor this"proteini^ surveillance. 
Protein, as well as, antibodies directed against the protein may show utility as a tumor 

30 marker and/or immunotherapy targets for the above listed tissues. 
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Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO:51 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
5 excluded from the scope of the present invention. To list every related sequence 

would be cumbersome. Accordingly, preferably excluded from the present invention 
are one or more polynucleotides comprising a nucleotide sequence described by the 
genera] formula of a-b, where a is any integer between 1 to 735 of SEQ ID NO:51, b 
is an integer of 15 to 749, where both a and b correspond to the positions of 
10 nucleotide residues shown in SEQ ID NO:51 , and where b is greater than or equal to a 
+ 14. 



FEATURES OF PROTEIN ENCODED BY GENE NO: 42 

The translation product of this gene shares weak sequence homology with 

15 Human metastasis suppressor KiSS-1 fragment, which is thought to be important in 
the diagnosis, prevention, staging and/or treatment of cancers, such as melanoma (See 
Accession No. W 1 5789). 

In specific embodiments, polypeptides of the invention comprise the following 
amino acid sequences: GQGPAGRWVRRLPCSRRAGGERGPHWGVWAGPQM 

20 SCGLXFGP (SEQ ID NO:259), WRTQGPMVLLWVVTCPATMLTEPQNPHLIGF 
VAYSGPSHTTQPHKYWLLLDGQADPAAAEGPVKJIKAASVVWWPQALRHLS 
LLVHCWEESYEMNIGCQSLWAGGLASSGNGWDLGVAFRRDTCMSSSSLHW 
KEFKYAPGSLHYFALSFVLILTEICLVSSGMGFPQEGKHFSVLGSPDGSLWGR 
DEHVPREFA (SEQ ID NO:2 6 0 ), 

25 WRTQGPMVLLWVVTCPATMLTEPQNPHLIGFVAY SGPSHTTQ (SEQ ID 
NO:261), PHKYWLLLDGQADPAAAEGPVKRKAASVVWW PQALRHLSLL 
(SEQ ID NO:262), VHC WEES YEMNIGCQSLWAGGL ASS GNGW 

— -D-L-G-V-A F R-R-D-T-C M ( S E Q I D" N O : 2 6 3 )- 

SSSSLHWKEFKYAPGSLHYFALSFVLILT EICLVSSGMGFPQEG (SEQ ID 

30 NO:264), and/or KHFSVLGSPDCSLWGRDEHV PREFA (SEQ ID NO:265). 
Polynucleotides encoding these polypeptides are also encompassed by the invention. 
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The gene encoding the disclosed cDNA is thought to reside on chromosome 1. 
Accordingly, polynucleotides related to this invention are useful as a marker in 
linkage analysis for chromosome 1. 

It has been discovered that this gene is expressed primarily in tonsils, 
osteoclastoma and teratocarcinoma tissues, and to a lesser extent in female bladder, 
adipose tissue, myeloid progenitor, prostate tissue, and number of other tissues. 

Therefore, nucleic acids of the invention are useful as reagents for differential 
identification of the tissue(s) or cell type(s) present in a biological sample and for 
diagnosis of the following diseases and conditions: diseases related to tonsils and 
osteoclasts. Similarly, polypeptides and antibodies directed to those polypeptides are 
useful to provide immunological probes for differential identification of the tissue(s) 
or cell type(s). For a number of disorders of the above tissues or cells, particularly of 
the immune and bone system, expression of this gene at significantly higher or lower 
levels may be detected in certain tissues or cell types (e.g., immune, skeletal, 
cancerous and wouffded tissues) or bodily fluids (e.g., lymph, serum, plasma, urine, 
synovial fluid or spinal fluid) taken from an individual having such a disorder, 
relative to the standard gene expression level, i.e., the expression level in healthy 
tissue from an individual not having the disorder. 

The tissue distribution in tonsils and osteoclastoma tissues suggests that the 
protein product of this clone is useful for the diagnosis and/or treatment of diseases 
related to tonsils and osteoclasts. For example, tonsillitis, adenoids, peritonsilar 
abscess, neoplasms, or abnormal growth and modelling of the bone, osteonecrosis, 
osteoporosis, osteodystrophy, osteoclastoma etc. Expression of this gene product in 
osteoclastoma suggests that it may play a role in the survival, proliferation, and/or 
growth of osteoclasts. Therefore, it may be useful in influencing bone mass in such 
conditions as osteoporosis. 

Moreover, the expression of this gene product in tonsils suggests a role in the 
regulation'of the" proliferation^ 

potentially all hematopoietic cell lineages, including blood stem cells. This gene 
product may be involved in the regulation of cytokine production, antigen 
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presentation, or other processes that may also suggest a usefulness in the treatment of 
cancer (e.g. by boosting immune responses). 

Since the gene is expressed in cells of lymphoid origin, the gene or protein, as 
well as, antibodies directed against the protein may show utility as a tumor marker 
5 and/or immunotherapy targets for the above listed tissues. Therefore it may be also 
used as an agent for immunological disorders including arthritis, asthma, immune 
deficiency diseases such as AIDS, leukemia, rheumatoid arthritis, inflammatory 
bowel disease, sepsis, acne, and psoriasis. In addition, this gene product may have 
commercial utility in the expansion of stem cells and committed progenitors of 

10 various blood lineages, and in the differentiation and/or proliferation of various cell 
types. Protein, as well as, antibodies directed against the protein may show utility as a 
tumor marker and/or immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 

15 related to SEQ IDTSIO:52 andmay.have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence 
would be cumbersome. Accordingly, preferably excluded from the present invention 
are one or more polynucleotides comprising a nucleotide sequence described by the 

20 general formula of a-b, where a is any integer between 1 to 1077 of SEQ ID NO:52, b 
is an integer of 15 to 1091, where both a and b correspond to the positions of 
nucleotide residues shown in SEQ ID NO:52, and where b is greater than or equal to a 
+ 14. 

25 FEATURES OF PROTEIN ENCODED BY GENE NO: 43 

The translation product of this gene shares sequence homology with the 

Drosophila gene "maleless", which is one of four known regulatory loci required for 
increased transcription (dosage~compensation)"of X-linkedgenes (SeeGeribarik 

Accession No.: gil 157906). 
30 It has been discovered that this gene is expressed primarily in normal prostate 

tissue, testes tissue, whole 6-week old embryonic tissue, human colon carcinoma 
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(HCC) cell line, and cerebellum tissue, and to a lesser extent in primary breast cancer, 
activated T-cells, and many other tissues. 

Therefore, nucleic acids of the invention are useful as reagents for differential 
identification of the tissue(s) or cell type(s) present in a biological sample and for 
diagnosis of the following diseases and conditions: diseases of the prostate or colon, 
or male reproductive disorders. Similarly, polypeptides and antibodies directed to 
those polypeptides are useful to provide immunological probes for differential 
identification of the tissue(s) or cell type(s). For a number of disorders of the above 
tissues or cells, particularly of the prostate or colon carcinoma, and male reproductive 
disorders, expression of this gene at significantly higher or lower levels may be 
detected in certain tissues or cell types (e.g., colon, prostate, reproductive, cancerous 
and wounded tissues) or bodily fluids (e.g., lymph, serum, plasma, urine, synovial 
fluid or spinal fluid) taken from an individual having such a disorder, relative to the 
standard gene expression level, i.e., the expression level in healthy tissue from an 
individual not having the disorder. 

Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 
150 as residues: Val-39 to Ala-45. 

The tissue distribution in colon and prostate tissues suggests that the protein 
product of this clone is useful for the diagnosis and/or treatment of prostate disorders 
such as prostatitis, prostatic hyperplasia, prostate cancers, or human colon carcinoma, 
as well as cancers of other tissues where expression has been observed. Alternatively, 
the tissue distribution in testes tissue, in conjunction with the homology to the 
Drosophila maleless gene, suggests that the translation product of this gene is useful - 
for the detection and/or treatment of disorders involving the testes or the transcription 
of X-linked genes. Furthermore, the tissue distribution indicates that the protein 
product of this clone is useful for the treatment and diagnosis of conditions 
concerning proper testicular function (e.g. endocrine function, sperm maturation), as 

-well as cancer^ There forerthis gene-producHs usefuHn the treatment of male — ■ - 

infertility and/or impotence. 

This gene product is also useful in assays designed to identify binding agents, 
as such agents (antagonists) are useful as male contraceptive agents. Similarly, the 
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protein is believed to be useful in the treatment and/or diagnosis of testicular cancer. 
The testes are also a site of active gene expression of transcripts that may be 
expressed, particularly at low levels, in other tissues of the body. Therefore, this gene 
product may be expressed in other specific tissues or organs where it may play related 
functional roles in other processes, such as hematopoiesis, inflammation, bone 
formation, and kidney function, to name a few possible target indications. Protein, as 
well as, antibodies directed against the protein may show utility as a tumor marker 
and/or immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO: 53 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence 
would be cumbersome. Accordingly, preferably excluded from the present invention 
are one or more polynucleotides comprising a nucleotide sequence described by the 
general formula of a-b, where a is any integer between 1 to 2240 of SEQ ID NO:53, b 
is an integer of 15 to 2254, where both a and b correspond to the positions of 
nucleotide residues shown in SEQ ID NO:53, and where b is greater than or equal to a 
+ 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 44 

The translation product of this gene shares weak sequence homology with 
Eimeria antigen Eam45 M3, which is thought to be important in uses as a vaccine for 
protecting chickens against coccidiosis. 

It has been discovered that this gene is expressed primarily in adrenal gland 
tissue, and to a lesser extent in activated T-cells. 

Therefore, nucleic acids of the invention are useful as reagents for differential 
identification of the tissue(s) orceli type(s) present in abiological sample ancTfor 
diagnosis of the following diseases and conditions: adrenal cortical insufficiency, 
adrenal cortical hyperfunction, neoplasia. Similarly, polypeptides and antibodies 
directed to those polypeptides are useful to provide immunological probes for 
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differential identification of the tissue(s) or cell type(s). For a number of disorders of 
the above tissues or cells, particularly of the endocrine system, expression of this gene 
at significantly higher or lower levels may be detected in certain tissues or cell types 
(e.g., endocrine, cancerous and wounded tissues) or bodily fluids (e.g., lymph, serum, 
5 plasma, urine, synovial fluid or spinal fluid) taken from an individual having such a 
disorder, relative to the standard gene expression level, i.e., the expression level in 
healthy tissue from an individual not having the disorder. 

The tissue distribution in adrenal gland tissue suggests that the protein product 
of this clone is useful for the diagnosis and/or intervention of disorders caused by 

10 adrenal gland abnormalities, such as adrenal cortical insufficiency, adrenal cortical 
hyperfunction, and neoplasia. More generally, the tissue distribution suggests that the 
protein product of this clone is useful for the detection, treatment, and/or prevention 
of various endocrine disorders and cancers, particularly Addison's disease, Cushing's 
Syndrome, and disorders and/or cancers of the pancrease (e.g. diabetes mellitus), 

15 adrenal cortex, ovaries, pituitary (e.g., Tiyper-, hypopituitarism), thyroid (e.g. hypei 1 -, 
hypothyroidism), parathyroid (e.g. hyper-, hypoparathyroidism) , hypothalamus, and 
testes. Protein, as well as, antibodies directed against the protein may show utility as a 
tumor marker and/or immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 

20 available and accessible through sequence databases. Some of these sequences are 

related to SEQ ID NO: 54 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence 
would be cumbersome. Accordingly, preferably excluded from the present invention 

25 are one or more polynucleotides comprising a nucleotide sequence described by the 
general formula of a-b, where a is any integer between 1 to 472 of SEQ ID NO:54, b 
is an integer of 15 to 486, where both a and b correspond to the positions of 
nucleotide residues shown~in~SEQ ID~NO:54rand whereb is~gTeater th^^requal to a 
+ 14. 

30 

FEATURES OF PROTEIN ENCODED BY GENE NO: 45 
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The translation product of this gene shares sequence homology with neural 
thread protein, tumor necrosis factor related gene product, human alpha-lC2 
adrenalin receptor, which is thought to be important for diagnosing the presence of 
Alzheimer's disease, neuroectodermal tumours and a malignant astrocytoma, or 
diagnosis of hepatocellular carcinomas and preneoplastic or pathological conditions 
of the liver, and tumor immunity. 

It has been discovered that this gene is expressed primarily in activated T-cells 
and endothelial cells. 

Therefore, nucleic acids of the invention are useful as reagents for differential 
identification of the tissue(s) or cell type(s) present in a biological sample and for 
diagnosis of the following diseases and conditions: Alzheimer's disease, 
neuroectodermal tumours and a malignant astrocytoma, hepatocellular carcinomas 
and tumors of various origins. Similarly, polypeptides and antibodies directed to those 
polypeptides are useful to provide immunological probes for differential identification 
of the tissue(s) or cell type(s). For a number of disorders of the above tissues or cells, 
particularly of the immune system and endothelial cells, expression of this gene at 
significantly higher or lower levels may be detected in certain tissues or cell types 
(e.g., immune, endothelial, cancerous and wounded tissues) or bodily fluids (e.g., 
lymph, serum, plasma, urine, synovial fluid or spinal fluid) taken from an individual 
having such a disorder, relative to the standard gene expression level, i.e., the 
expression level in healthy tissue from an individual not having the disorder. 

Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 
152 as residues:-Arg-38 to Arg-47. 

The tissue distribution in immune and endothelial tissues, and the homology to 
neural thread protein, tumor necrosis factor related gene product, human alpha- 1C2 
adrenalin receptor, or Smaller hepatocellular oncoprotein (hhcm) gene product 
suggests that the protein product of this clone is useful for the diagnosis and/or 
treatmelirof tumors of Various originsTincluding neuroectodermal tumours and a 
malignant astrocytoma, hepatocellular carcinomas, as well as syndromes inflicted by 
these cancers. Protein, as well as, antibodies directed against the protein may show 
utility as a tumor marker and/or immunotherapy targets for the above listed tissues. 
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Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO:55 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence 
would be cumbersome. Accordingly, preferably excluded from the present invention 
are one or more polynucleotides comprising a nucleotide sequence described by the 
general formula of a-b, where a is any integer between 1 to 1256 of SEQ ID NO:55, b 
is an integer of 15 to 1270, where both a and b correspond to the positions of 
nucleotide residues shown in SEQ ID NO:55, and where b is greater than or equal to a 
+ 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 46 

It has been discovered that this gene is expressed primarily in tumor tissues 
such as hepatocellular tumor, hemangiopericytoma, chronic lymphocytic leukemia, 
and activated T-cells. 

Therefore, nucleic acids of the invention are useful as reagents for differential 
identification of the tissue(s) or cell type(s) present in a biological sample and for 
diagnosis of the following diseases and conditions: tumors of various origins. 
Similarly, polypeptides and antibodies directed to those polypeptides are useful to 
provide immunological probes for differential identification of the tissue(s) or cell 
type(s). For a number of disorders of the above tissues or cells, particularly of the 
hepatocellular tumor, expression of this gene at significantly higher or lower levels 
may be detected in certain tissues or cell types (e.g., liver, immune, cancerous and 
wounded tissues) or bodily fluids (e.g., lymph, serum, plasma, urine, synovial fluid or 
spinal fluid) taken from an individual having such a disorder, relative to the standard 
gene expression level, i.e., the expression level in healthy tissue from an individual 
noTlfavinj^tlie^is^iBer 

The tissue distribution in hepatocellular tumors suggests that the protein 
product of this clone is useful for the diagnosis and/or targeting of hepatocellular 
carcinomas, preneoplastic or pathological conditions of the liver, Alzheimer's disease, 
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neuroectodermal tumours and malignant astrocytoma. Protein, as well as, antibodies 
directed against the protein may show utility as a tumor marker and/or 
immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
5 available and accessible through sequence databases. Some of these sequences are 

related to SEQ ID NO:56 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence 
would be cumbersome. Accordingly, preferably excluded from the present invention 
10 are one or more polynucleotides comprising a nucleotide sequence described by the 
general formula of a-b, where a is any integer between 1 to 2045 of SEQ ID NO:56, b 
is an integer of 15 to 2059, where both a and b correspond to the positions of 
nucleotide residues shown in SEQ ID NO:56, and where b is greater than or equal to a 
+ 14. 

15 - 

FEATURES OF PROTEIN ENCODED BY GENE NO: 47 

It has been discovered that this gene is expressed primarily in glioblastoma, 
ulcerative colitis, and hemangiopericytoma. 

Therefore, nucleic acids of the invention are useful as reagents for differential 

20 identification of the tissue(s) or cell type(s) present in a biological sample and for 
diagnosis of the following diseases and conditions: glioblastoma, 
hemangiopericytoma and their inflicted disorders. Similarly, polypeptides and 
antibodies directed to those polypeptides are useful to provide immunological probes 
for differential identification of the tissue(s) or cell type(s). For a number of disorders 

25 of the above tissues or cells, particularly of the brain tissues, expression of this gene 
at significantly higher or lower levels may be detected in certain tissues or cell types 
(e.g., neural, cancerous and wounded tissues) or bodily fluids (e.g., lymph, serum, 

pIaOTarurine7^ribvM^ individual having such a — 

disorder, relative to the standard gene expression level, i.e., the expression level in 
30 healthy tissue from an individual not having the disorder. 
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Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 
154 as residues: Pro-31 to Ala-37. 

The tissue distribution suggests that the protein product of this clone would be 
useful for the diagnosis, targeting and/or treatment of tumors in the brain, such as 
glioblastoma and hemangiopericytoma. Additionally, the gene products can be useful 
agent for the diagnosis and treatment of ulcerative colitis. Protein, as well as, 
antibodies directed against the protein may show utility as a tumor marker and/or 
immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO:57 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence 
would be cumbersome. Accordingly, preferably excluded from the present invention 
are one or more polynucleotides comprising a nucleotide sequence described by the 
general formula of a-b, where a is any integer between 1 to 854 of SEQ ID NO:57, b 
is an integer of 15 to 868, where both a and b correspond to the positions of 
nucleotide residues shown in SEQ ID NO:57, and where b is greater than or equal to a 
+ 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 48 

It has been discovered that this gene is expressed primarily in bone marrow. 
Therefore, nucleic acids of the invention are useful as reagents for differential 
identification of the tissue(s) or cell type(s) present in a biological sample and for 
diagnosis of the following diseases and conditions: immunodeficiency, tumor 
necrosis, infection, lymphomas, auto-immunities, cancer, inflammation, anemias 
(leukemia) and other hematopoeitic disorders. Similarly, polypeptides and antibodies 
clirected to those polj^eptideslire u^ 

differential identification of the tissue(s) or cell type(s). For a number of disorders of 
the above tissues or cells, particularly of the immune system, expression of this gene 
at significantly higher or lower levels may be detected in certain tissues or cell types 
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(e.g., immune, cancerous and wounded tissues) or bodily fluids (e.g., lymph, serum, 
plasma, urine, synovial fluid or spinal fluid) taken from an individual having such a 
disorder, relative to the standard gene expression level, i.e., the expression level in 
healthy tissue from an individual not having the disorder. 
5 Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 

' " -155 as residues: Thr-47 to Val-53. 

The tissue distribution in bone marrow suggests that the protein product of this 
clone is useful for the diagnosis and/or treatment of immune disorders including: 
leukemias, lymphomas, auto-immunities, immunodeficiencies (e.g. AIDS), immuno- 

10 supressive conditions (transplantation) and hematopoeitic disorders. In addition this 
gene product may be applicable in conditions of general microbial infection, 
inflammation or cancer. Furthermore, the tissue distribution in bone marrow suggests 
that the protein product of this clone is useful for the treatment and diagnosis of 
hematopoietic related disorders such as anemia, pancytopenia, leukopenia, 

15 thrombocytopenia or leukemia. 

The uses include bone marrow celi ex vivo culture, bone marrow 
transplantation, bone marrow reconstitution, radiotherapy or chemotherapy of 
neoplasia. The gene product may also be involved in lymphopoiesis, therefore, it can 
be used in immune disorders such as infection, inflammation, allergy, 

20 immunodeficiency etc. In addition, this gene product may have commercial utility in 
the expansion of stem cells and committed progenitors of various blood lineages, and 
in the differentiation and/or proliferation of various cell types. Protein, as well as, 
antibodies directed against the protein may show utility as a tumor marker and/or 
immunotherapy targets for the above listed tissues. 

25 Many polynucleotide sequences, such as EST sequences, are publicly 

available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO:58 and may have been publicly available prior to conception of 
tKeTpresenrin vention: Pref^aBly7^ch"relafed"poly nucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence 

30 would be cumbersome. Accordingly, preferably excluded from the present invention 
are one or more polynucleotides comprising a nucleotide sequence described by the 
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general formula of a-b, where a is any integer between 1 to 972 of SEQ ID NO:58, b 
is an integer of 15 to 986, where both a and b correspond to the positions of 
nucleotide residues shown in SEQ ID NO:58, and where b is greater than or equal to a 
+ 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 49 

It has been discovered that this gene is expressed primarily in bone marrow. 

Therefore, nucleic acids of the invention are useful as reagents for differential 
identification of the tissue(s) or cell type(s) present in a biological sample and for 
diagnosis of the following diseases and conditions: immunodeficiency, tumor 
necrosis, infection, lymphomas, auto-immunities, cancer, inflammation, anemias 
(leukemia) and other hematopoeitic disorders. Similarly, polypeptides and antibodies 
directed to those polypeptides are useful to provide immunological probes for 
differential identification of the tissue(s) or cell type(s). For a number of disorders of 
the above tissues orcells, particularly of the immune system, expression of this gene 
at significantly higher or lower levels may be detected in certain tissues or cell types 
(e.g., immune, cancerous and wounded tissues) or bodily fluids (e.g., lymph, serum, 
plasma, urine, synovial fluid or spinal fluid) taken from an individual having such a 
disorder, relative to the standard gene expression level, i.e., the expression level in 
healthy tissue from an individual not having the disorder. 

Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 
156 as residues: Leu-40 to Cys-47. 

: The bone marrow tissue distribution suggests that the protein product of this 

clone would be useful for the diagnosis and treatment of immune disorders including: 
leukemias, lymphomas, auto-immunities, immunodeficiencies (e.g. AIDS), immuno- 
supressive conditions (transplantation) and hematopoeitic disorders. In addition this 
gene product may be applicable in conditions of general microbial infection, 
inflammation or cancerrFurtheimore^ suggests 
that the protein product of this clone is useful for the treatment and diagnosis of 
hematopoietic related disorders such as anemia, pancytopenia, leukopenia, 
thrombocytopenia or leukemia. 
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The uses include bone marrow cell ex vivo culture, bone marrow 
transplantation, bone marrow reconstitution, radiotherapy or chemotherapy of 
neoplasia. The gene product may also be involved in lymphopoiesis, therefore, it can 
be used in immune disorders such as infection, inflammation, allergy, 
5 immunodeficiency etc. In addition, this gene product may have commercial utility in 
the expansion of stem cells and committed progenitors of various blood lineages, and 
in the differentiation and/or proliferation of various cell types. Protein, as well as, 
antibodies directed against the protein may show utility as a tumor marker and/or 
immunotherapy targets for the above listed tissues. 

10 Many polynucleotide sequences, such as EST sequences, are publicly 

available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO:59 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence 

15 would be cumbersome. Accordingly, preferably excluded from the present invention 
are one or more polynucleotides comprising a nucleotide sequence described by the 
general formula of a-b, where a is any integer between 1 to 681 of SEQ ID NO:59, b 
is an integer of 15 to 695, where both a and b correspond to the positions of 
nucleotide residues shown in SEQ ID NO:59, and where b is greater than or equal to a 

20 +14. 



FEATURES OF PROTEIN ENCODED BY GENE NO: 50 

Inspecific embodiments, polypeptides of the in vehtion comprise the following 
amino acid sequence: IAQGTVPLTKRGVQSSGPDYPEGTLTPLPRG (SEQ ID 
25 NO:266 and 267). Polynucleotides encoding these polypeptides are also encompassed 
by the invention. 

It has been discovered that this gene is expressed primarily in dendritic cells. 
Therefore, nucleic acids of the invention are useful as reagents for differential 
identification of the tissue(s) or cell type(s) present in a biological sample and for 
30 diagnosis of the following diseases and conditions: immune disorders and related 

conditions such as leukemias, lymphomas, inflammation, hematopoeitic disfunction, 
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arthritis and asthma. Similarly, polypeptides and antibodies directed to those 
polypeptides are useful to provide immunological probes for differential identification 
of dendritic cells. For a number of disorders of the above tissues or cells, particularly 
of the immune system, expression of this gene at significantly higher or lower levels 
5 may be detected in certain tissues or cell types (e.g., dendritic cells, cancerous and 
wounded tissues) or bodily fluids (e.g., lymph, serum, plasma, urine, synovial fluid or 
spinal fluid) taken from an individual having such a disorder, relative to the standard 
gene expression level, i.e., the expression level in healthy tissue from an individual 
not having the disorder. 

10 Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 

1 57 as residues: Ser-25 to Phe-3 1 , Lys-55 to Arg-6 1 . 

The tissue distribution in dendritic cells suggests that the protein product of 
this clone is useful for the diagnosis and/or treatment of immune disorders including: 
leukemias, lymphomas, auto-immunities, immunodeficiencies (e.g. AIDS), immuno- 

15 supressive conditions (transplantation) and hematopoeitic disorders. In addition this 
gene product may be applicable in conditions of general microbial infection, 
inflammation or cancer. 

Moreover, the expression of this gene product in dendritic cells also strongly 
suggests a role for this protein in immune function and immune surveillance. Protein, 

20 as well as, antibodies directed against the protein may show utility as a tumor marker 
and/or immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO:60 and may have been publicly available prior to conception of 

25 the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence 

- jwo.ul dbe _g umbers ome . Accordingly , preferably excluded from the present invention 

are one or more polynucleotides comprising a nucleotide sequence described by the 
general formula of a-b, where a is any integer between 1 to 300 of SEQ ID NO:60, b 

30 is an integer of 15 to 314, where both a and b correspond to the positions of 
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nucleotide residues shown in SEQ ID NO:60, and where b is greater than or equal to a 
+ 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 51 

In specific embodiments, polypeptides of the invention comprise the following 
amino acid sequence: DCLYLALSFPWHCHCHHHPPSGSLLYPF (SEQ ID 
NO:268). Polynucleotides encoding these polypeptides are also encompassed by the 
invention. The translation product of this gene shares sequence homology with a C. 
elegans protein of unknown function (See Genbank Accession No.: gill 947 142 
(AF000264)). 

It has been discovered that this gene is expressed primarily in healing 
abdominal wound tissue. 

Therefore, nucleic acids of the invention are useful as reagents for differential 
identification of the tissue(s) or cell type(s) present in a biological sample and for 
diagnosis of the following diseases' and conditions: tissue necrosis, wound healing, 
ulceration, neoplasms or cancer. Similarly, polypeptides and antibodies directed to 
those polypeptides are useful to provide immunological probes for differential 
identification of the tissue(s) or cell type(s). For a number of disorders of the above 
tissues or cells, particularly of injured tissue, expression of this gene at significantly 
higher or lower levels may be detected in certain tissues or cell types (e.g., vascular, 
endothelial, cancerous and wounded tissues) or bodily fluids (e.g., lymph, serum, 
plasma, urine, synovial fluid or spinal fluid) taken from an individual having such a 

disorder, relative to thestandard gene expression level, i.e., the expression level in 

healthy tissue from an individual not having the disorder. 

Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 
158 as residues: Pro-34 to Tyr-43, Gln-73 to Cys-86, Pro-98 to Leu-103. 

The tissue distribution in healing abdominal wound tissue suggests that the 
_prc^hLpxoduct_ofthisxlone_is-Useful for-the4reatment-and/or-diagnosis-of conditions — 
involving tissue repair and wound healing. Tissue repair may be indicated in cases of 
injury to the skin or internal organs, ulceration, cellular necrosis or other conditions 
involving healing of both diseased or non-diseased, traumatized tissue. In addition, 
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because of the implications of tissue regeneration, remoldeling and growth regulation, 
the protein product of this gene may have indications in the diagnosis and treatment 
of neoplasms and cancer. 

More generally, the tissue distribution in endothelial tissue indicates that the 
5 . protein product of this gene is useful for the diagnosis and treatment of conditions and 
pathologies of the cardiovascular system, such as heart disease, restenosis, 
atherosclerosis, stoke, angina, thrombosis, and wound healing. Likewise, the tissue 
distribution further suggests that the protein product of this clone is useful for the 
treatment, diagnosis, and/or prevention of various skin disorders including congenital 

10 disorders (i.e. nevi, moles, freckles, Mongolian spots, hemangiomas, port-wine 
syndrome), integumentary tumors (i.e. keratoses, Bowen's disease, basal cell 
carcinoma, squamous cell carcinoma, malignant melanoma, Paget's disease, mycosis 
fungoides, and Kaposi's sarcoma), injuries and inflammation of the skin (i.e.wounds, 
rashes, prickly heat disorder, psoriasis, dermatitis), atherosclerosis, uticaria, eczema, 

15 photosensitivity, autoimmune disorders"(i.e. lupus erythematosus, vitiligo, 

dermatomyositis, morphea, scleroderma, pemphigoid, and pemphigus), keloids, striae, 
erythema, petechiae, purpura, and xanthelasma. In addition, such disorders may 
predispose increased susceptibility to viral and bacterial infections of the skin (i.e. 
cold sores, warts, chickenpox, molluscum contagiosum, herpes zoster, boils, cellulitis, 

20 erysipelas, impetigo, tinea, althletes foot, and ringworm). Moreover, the protein 
product of this clone may also be useful for the treatment or diagnosis of various 
connective tissue disorders such as arthritis, trauma, tendonitis, chrondomalacia and 
inflammation, autoimmune disorders such as rheumatoid arthritis, lupus, scleroderma, 
and dermatomyositis as well as dwarfism, spinal deformation, and specific joint 

25 abnormalities as well as chondrodysplasias (i.e. spondyloepiphyseal dysplasia 
congenita, familial osteoarthritis, Atelosteogenesis type II, metaphyseal 
chondrodysplasia type Schmid). Protein, as well as, antibodies directed against the 

protein may show utility-as a-tumor-marker and/or-immunotherapy targets for the- - — 

above listed tissues. 

30 Many polynucleotide sequences, such as EST sequences, are publicly 

available and accessible through sequence databases. Some of these sequences are 
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related to SEQ ID NO:61 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence 
would be cumbersome. Accordingly, preferably excluded from the present invention 
are one or more polynucleotides comprising a nucleotide sequence described by the 
general formula of a-b, where a is any integer between 1 to 720 of SEQ ID NO:61, b 
is an integer of 1 5 to 734, where both a and b correspond to the positions of 
nucleotide residues shown in SEQ ID NO:61, and where b is greater than or equal to a 
+ 14. 



FEATURES OF PROTEIN ENCODED BY GENE NO: 52 

The translation product of this gene shares sequence homology with FAR- 

17A, which is an androgen induced protein, absent in castrated hamsters (See 

Genbank Accession No.: gill91315), as well as a male hormone-dependent gene 
15 product (See GenSeq Accession No.: R10612). The gene encoding the disclosed 

cDNA is thought to reside on chromosome 6. Accordingly, polynucleotides related to 

this invention are useful as a marker in linkage analysis for chromosome 6. 

In specific embodiments, polypeptides of the invention comprise the following 

amino acid sequences: ASLPPSRSRPLANMA1.VPCQVLRMAILLSYCSILCNYKA 
20 ffiMPSHQTYGGSWKFLTFIDLVIQAVFFGICVLTDLSSLLTRGSGNQEQERQLK 

KLISLRDW (SEQ ID NO:269). Polynucleotides encoding these polypeptides are also 

encompassed by the invention. 

It has been-discovered that this gene is expressed primarily in fetal liver and 

spleen tissue, and to a lesser extent in a variety of other fetal tissues and brain tissues. 
25 Therefore, nucleic acids of the invention are useful as reagents for differential 

identification of the.tissue(s) or cell type(s) present in a biological sample and for 

diagnosis of the following diseases and conditions: immune disorders including 
leukemias,-ly mphomas ; reproducti ve and endocrine-disorders; including testicular 

cancer; and liver disorders (e.g. hepatoblastoma, metabolic diseases and conditions 
30 that are attributable to the differentiation of hepatocyte progenitor cells). Similarly, 

polypeptides and antibodies directed to those polypeptides are useful to provide 
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immunological probes for differential identification of the tissue(s) or cell type(s). For 
.a number of disorders of the above tissues or cells, particularly of the immune and 
reproductive systems, expression of this gene at significantly higher or lower levels 
may be detected in certain tissues or cell types (e.g., immune, reproductive, cancerous 
and wounded tissues) or bodily fluids (e.g., lymph, serum, plasma, urine, synovial 
fluid or spinal fluid) taken from an individual having such a disorder, relative to the 
standard gene expression level, i.e., the expression level in healthy tissue from an 
individual not having the disorder. 

Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 
159 as residues; Thr-59 to Gly-70, Tyr-132 to Glu-150. 

The tissue distribution and homology to FAR-17A suggests that the protein 
product of this clone is useful for the treatment and/or diagnosis of androgen related 
conditions and disorders. Male reproductive and endocrine disorders would be 
potential area of application (e.g. endocrine function, sperm maturation). It may also 
prove to be valuable in the diagnosis and treatment of testicular cancer. 

More generally, the protein product of this clone may be useful for the 
treatment and/or diagnosis of conditions concerning proper testicular function (e.g. 
endocrine function, sperm maturation), as well as cancer. Therefore, this gene product 
is useful in the treatment of male infertility and/or impotence. This gene product is 
also useful in assays designed to identify binding agents, as such agents (antagonists) 
are useful as male contraceptive agents. Similarly, the protein is believed to be useful 
in the treatment and/or diagnosis of testicular cancer. The testes are also a site of 
active gene expression of transcripts that may be expressed, particularly at low levels, 
in other tissues of the body. Therefore, this gene product may be expressed in other 
specific tissues or organs where it may play related functional roles in other 
processes, such as hematopoiesis, inflammation, bone formation, and kidney function, 
to name a few possible target indications. Protein, as well as, antibodies directed 
against the proteinmayshow utility ^Oumof marker Md/or~ii^uno^tfieTapy~targets~ 
for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
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related to SEQ ID NO:62 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence 
would be cumbersome. Accordingly, preferably excluded from the present invention 
5 are one or more polynucleotides comprising a nucleotide sequence described by the 
general formula of a-b, where a is any integer between 1 to 1396 of SEQ ID NO:62, b 
is an integer of 15 to 1410, where both a and b correspond to the positions of 
nucleotide residues shown in SEQ ID NO: 62, and where b is greater than or equal to a 
+ 14. 

10 

FEATURES OF PROTEIN ENCODED BY GENE NO: 53 

Contact of cells with supernatant expressing the product of this gene has been 
shown to increase the permeability of the plasma membrane of THP-1 to calcium. 
Thus it is likely that the product of this gene is involved_in a signal transduction 

15 pathway that is initiated when the product binds a receptor on the surface of the 

plasma membrane of monocytes, and to a lesser extent, in immune or hematopoietic 
cells and tissues. Thus, polynucleotides and polypeptides have uses which include, 
but are not limited to, activating monocytes. 

In specific embodiments, polypeptides of the invention comprise the following 

20 amino acid sequence: MSRSSRISGLSCPWLL (SEQ ID NO:270). Polynucleotides 
encoding these polypeptides are also encompassed by the invention. The gene 
encoding the disclosed cDNA is believed to reside on chromosome 1. Accordingly, 
polynucleotides related to this invention are useful as a marker in linkage analysis for 
chromosome 1. 

25 It has been discovered that this gene is expressed primarily in T-cells. 

Therefore, nucleic acids of the invention are useful as reagents for differential 
identification of the tissue(s) or cell type(s) present in a biological sample and for 
diagnosis of the following ^isea^sluia conditio and hematopoietic" 

diseases and/or disorders. Similarly, polypeptides and antibodies directed to those 

30 polypeptides are useful to provide immunological probes for differential identification 
of the tissue(s) or cell type(s). For a number of disorders of the above tissues or cells, 
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particularly of the immune and haemopoietic systems, expression of this gene at 
significantly higher or lower levels may be detected in certain tissues or cell types 
(e.g., immune, hematopoietic, and cancerous and wounded tissues) or bodily fluids 
(e.g., lymph, serum, plasma, urine, synovial fluid or spinal fluid) taken from an 
individual having such a disorder, relative to the standard gene expression level, i.e., 
the expression level in healthy tissue from an individual not having the disorder. 

Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 
160 as residues: Pro-42 to Cys-50, Leu-61 to Ala-66. 

The tissue distribution in T-cells, combined with the detected calcium flux 
activity in monocytes suggests that the protein product of this clone would be useful 
for the treatment and diagnosis of immune and hematopoietic disorders. Morever, the 
expression of this gene product suggests a role in regulating the proliferation; 
survival; differentiation; and/or activation of hematopoietic cell lineages, including 
blood stem cells. This gene product may be involved in the regulation of cytokine 
production, antigen presentation, or other processes suggesting a usefulness in the 
treatment of cancer (e.g. by boosting immune responses). 

Since the gene is expressed in cells of lymphoid origin, the natural gene 
product may be involved in immune functions. Therefore it may be also used as an 
agent for immunological disorders including arthritis, asthma, immunodeficiency 
diseases such as AIDS, leukemia, rheumatoid arthritis, granulomatous disease, 
inflammatory bowel disease, sepsis, acne, neutropenia, neutrophilia, psoriasis, 
hypersensitivities, such as T-cell mediated cytotoxicity; immune reactions to 
transplanted organs and tissues, such as host-versus-graft and graft- versus-host 
diseases, or autoimmunity disorders, such as autoimmune infertility, lense tissue 
injury, demyelination, systemic lupus erythematosis, drug induced hemolytic anemia, 
rheumatoid arthritis, Sjogren's disease, scleroderma and tissues. 

Moreover, the protein may represent a secreted factor that influences the 
differentiation or behavior of other blood cellsTor tYatYe^iisTiematopoietic cells To 
sites of injury. In addition, this gene product may have commercial utility in the 
expansion of stem cells and committed progenitors of various blood lineages, and in 
the differentiation and/or proliferation of various cell types. Protein, as well as, 
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antibodies directed against the protein may show utility as a tumor marker and/or 
immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO:63 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence 
would be cumbersome. Accordingly, preferably excluded from the present invention 
are one or more polynucleotides comprising a nucleotide sequence described by the 
general formula of a-b, where a is any integer between 1 to 1217 of SEQ ED NO: 63, b 
is an integer of 15 to 1231, where both a and b correspond to the positions of 
nucleotide residues shown in SEQ ID NO:63, and where b is greater than or equal to a 
+ 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 54 

In specific embodiments, polypeptides of the invention comprise the following 
amino acid sequence: DHWPAGFLPPAPGLKFPVALEVFRKVLPAVCPTDCSGS 
AGKERNS (SEQ ID NO:271). Polynucleotides encoding these polypeptides are also 
encompassed by the invention. 

It has been discovered that this gene is expressed primarily in liver. 

Therefore, nucleic acids of the invention are useful as reagents for differential 
identification of the tissue(s) or cell type(s) present in a biological sample and for 
diagnosis of the following diseases and conditions: metabolic diseases and liver 
conditions. Similarly, polypeptides and antibodies directed to those polypeptides are 
useful to provide immunological probes for differential identification of the tissue(s) 
or cell type(s). For a number of disorders of the above tissues or cells, particularly of 
the metabolic system, expression of this gene at significantly higher or lower levels 

may be detected in certain tissues or cell types (eTgT7 Ke^ticrliveTrrMfaboIic7 and 

cancerous and wounded tissues) or bodily fluids (e.g., lymph, bile, serum, plasma, 
urine, synovial fluid or spinal fluid) taken from an individual having such a disorder, 
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relative to the standard gene expression level, i.e., the expression level in healthy 
tissue from an individual not having the disorder. 

Prefen-ed epitopes include those comprising a sequence shown in SEQ ID NO. 
161 as residues: Ser-31 to Gln-41. 

The tissue distribution in liver suggests that the protein product of this clone 
would be useful for treatment and diagnosis of disorders of the metabolic system and 
liver disorders. Morever, the protein product of this clone is useful for the detection 
and treatment of liver disorders and cancers (e.g. hepatoblastoma, jaundice, hepatitis, 
liver metabolic diseases and conditions that are attributable to the differentiation of 
hepatocyte progenitor cells). Protein, as well as, antibodies directed against the 
protein may show utility as a tumor marker and/or immunotherapy targets for the 
above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO:64 and -may "have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence 
would be cumbersome. Accordingly, preferably excluded from the present invention 
are one or more polynucleotides comprising a nucleotide sequence described by the 
general formula of a-b, where a is any integer between 1 to 598 of SEQ ID NO:64, b 
is an integer of 15 to 612, where both a and b correspond to the positions of 
nucleotide residues shown in SEQ ID NO: 64, and where b is greater than or equal to a 
+ 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 55 

When tested against PC 12 cell lines, supernatants removed from cells 
containing this gene activated the EGR1 (early growth response gene 1) promoter 
element. Thus, it is likely that this gene activates sensory neurorTcells, and to aTlesser _ 
extent in other neural cells and tissues, through the EGR1 signal transduction 
pathway. EGR1 is a separate signal transduction pathway from Jak-STAT, genes 
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containing the EGR1 promoter are induced in various tissues and cell types upon 
activation, leading the cells to undergo differentiation and proliferation. 

It has been discovered that this gene is expressed primarily in T-cells and 
monocytes, and to a lesser extent in cancerous tissues, including cancerous colon 

5 tissue and placenta. 

Therefore, nucleic acids of the invention are useful as reagents for differential 
identification of the tissue(s) or cell type(s) present in a biological sample and for 
diagnosis of the following diseases and conditions: immune and haemopoietic 
disorders and cancer such as colon cancer, but also such cancers as breast cancer, 
10 cardiac tumors, pancreatic cancer, melanoma, retinoblastoma, glioblastoma, lung 

cancer, intestinal cancer, testicular cancer, stomach cancer, neuroblastoma, myxoma, 
myoma, lymphoma, endothelioma, osteoblastoma, osteoclastoma, adenoma, and the 
like. Similarly, polypeptides and antibodies directed to those polypeptides are useful 
to provide immunological probes for differential identification of the tissue(s) or cell 
15 type(s). For a number of disorders of the above tissues or cells, particularly of the 

immune and haemopoietic systems, expression of this gene at significantly higher or 
lower levels may be detected in certain tissues or cell types (e.g., immune, 
hematopoietic, and cancerous and wounded tissues) or bodily fluids (e.g., lymph, 
serum, plasma, urine, synovial fluid or spinal fluid) taken from an individual having 
20 such a disorder, relative to the standard gene expression level, i.e., the expression 
level in healthy tissue from an individual not having the disorder. 

Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 
162 as residues: Glu-63 to Trp-72. 

The tissue distribution in T-cells and monocytes, combined with the detected 
25 EGR1 biological activity suggests that the protein product of this clone would be 
useful for treatment and diagnosis of disorders of the immune and haemopoietic 
systems and colon and other cancers. This gene product may be involved in the 

regulationof cytokine p^ processessuggesting^ 

a usefulness in the treatment of cancer (e.g. by boosting immune responses). 
30 Since the gene is expressed in cells of lymphoid origin, the natural gene 

product may be involved in immune functions. Therefore it may be also used as an 
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agent for immunological disorders including arthritis, asthma, immunodeficiency 
diseases such as AIDS, leukemia, rheumatoid arthritis, granulomatous disease, 
inflammatory bowel disease, sepsis, acne, neutropenia, neutrophilia, psoriasis, 
hypersensitivities, such as T-cell mediated cytotoxicity; immune reactions to 
5 transplanted organs and tissues, such as host-versus-graft and graft-versus-host 
diseases, or autoimmunity disorders, such as autoimmune infertility, lense tissue 
injury, demyelination, systemic lupus erythematosis, drug induced hemolytic anemia, 
rheumatoid arthritis, Sjogren's disease, scleroderma and tissues. 

Moreover, the protein may represent a secreted factor that influences the 

10 differentiation or behavior of other blood cells, or that recruits hematopoietic cells to 
sites of injury. In addition, this gene product may have commercial utility in the 
expansion of stem cells and committed progenitors of various blood lineages, and in 
the differentiation and/or proliferation of various cell types. Expression cellular 
sources marked by proliferating cells suggests this protein may play a role in the 

15 regulation of cellular division, and may show utility in the diagnosis and treatment of 
cancer and other proliferative disorders. Similarly, developmental tissues rely on 
decisions involving cell differentiation and/or apoptosis in pattern formation. 
Dysregulation of apoptosis can result in inappropriate suppression of cell death, as 
occurs in the development of some cancers, or in failure to control the extent of cell 

20 death, as is believed to occur in acquired immunodeficiency and certain 
neurodegenerative disorders, such as spinal muscular atrophy (SMA). 

Therefore, the polynucleotides and polypeptides of the present invention are 
useful in treating, detecting, and/or preventing said disorders, and conditions, in 
addition to other types of degenerative conditions. Thus this protein may modulate 

25 apoptosis or tissue differentiation and would be useful in the detection, treatment, 

and/or prevention of degenerative or proliferative conditions and diseases. Protein, as 
well as, antibodies directed against the protein may show utility as a tumor marker 

and/or immunotherapy_targets_for the above-listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 

30 available and accessible through sequence databases. Some of these sequences are 

related to SEQ ID NO:65 and may have been publicly available prior to conception of 
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the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence 
would be cumbersome. Accordingly, preferably excluded from the present invention 
are one or more polynucleotides comprising a nucleotide sequence described by the 
5 general formula of a-b, where a is any integer between 1 to 2256 of SEQ ID NO:65, b 
is an integer of 15 to 2270, where both a and b correspond to the positions of 
nucleotide residues shown in SEQ ID NO:65, and where b is greater than or equal to a 
+ 14. 

10 FEATURES OF PROTEIN ENCODED BY GENE NO: 56 

The translation product of this gene has homology with several human keratin 
genes at the nucleotide level (see, for example, Troyanovsky, et al., Eur. J. Cell Biol. 
59: 127-137 (1992) which is hereby incorporated by reference herein). Based on the 
sequence similarity, the translation product of this clone is expected to share 
15 biological activities with keratiri-arid growth factor proteins. Such activities are known 
in the art, and some of which are described elsewhere herein. 

It has been discovered that this gene is expressed primarily in neutrophils. 
Therefore, nucleic acids of the invention are useful as reagents for differential 
identification of the tissue(s) or cell type(s) present in a biological sample and for 
20 diagnosis of the following diseases and conditions: immune and haemopoietic 

disorders. Similarly, polypeptides and antibodies directed to those polypeptides are 
useful to provide immunological probes for differential identification of the tissue(s) 
or cell type(s). For a number of disorders of the above tissues or cells, particularly of 
the immune and haemopoietic system, expression of this gene at significantly higher 
25 or lower levels may be detected in certain tissues or cell types (e.g., cancerous and 

wounded tissues) or bodily fluids (e.g., lymph, serum, plasma, urine, synovial fluid or 
spinal fluid) taken from an individual having such a disorder, relative to the standard 

gene-expression level,-i.e.^ the expression-level-in healthy tissuefrom an individual 

not having the disorder. 
30 The tissue distribution in neutrophils suggests that the protein product of this 

clone would be useful for treatment and diagnosis of disorders of the immune and 
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haemopoietic system. Furthermore, sequence homology of the polynucleotides and 
polypeptides of the present invention with a number of human cytokeratin molecules, 
such as CK-8, CK-15, and CK-17, indicate that molecules of the present invention 
can be used diagnostically as markers of basal cell differentiation in complex epithelia 
5 and therefore indicative of a certain type of epithelial stem cells, as well as markers of 
the differentiation of other cell types such as neutrophils or other immune cells. 
Molecules of the present invention, or agonists or antagonists thereof, can also be 
used therapeutically to treat differentiation disorders of epithelial, neutrophil or other 
immune cell differentiation or activation. Protein, as well as, antibodies directed 

1 0 against the protein may show utility as a tumor marker and/or immunotherapy targets 
for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO:66 and may have been publicly available prior to conception of 

1 5 the present invention. Preferably, such related polynucleotides "are specifically 
excluded from the scope of the present invention. To list every related sequence 
would be cumbersome. Accordingly, preferably excluded from the present invention 
are one or more polynucleotides comprising a nucleotide sequence described by the 
general formula of a-b, where a is any integer between 1 to 1269 of SEQ ID NO:66, b 

20 is an integer of 15 to 1283, where both a and b correspond to the positions of 

nucleotide residues shown in SEQ ID NO:66, and where b is greater than or equal to a 
+ 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 57 
25 In specific embodiments, polypeptides of the invention comprise the following 

amino acid sequence: EEIATSffiPIRDFLAIVFFASIGLHVFPTFVAYELTVLVF 
LTLSVVV (SEQ ID NO:272). Polynucleotides encoding these polypeptides are also 

encompassed_by_the invention 

It has been discovered that this gene is expressed primarily in synovium, 
30 placenta, and stromal cells, and to a lesser extent in several other tissues and organs, 
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including, among others, bone marrow, palate, pituitary gland, and in tissue derived 
from osteosarcoma and chondrosarcoma. 

Therefore, nucleic acids of the invention are useful as reagents for differential 
identification of the tissue(s) or cell type(s) present in a biological sample and for 
5 diagnosis of the following diseases and conditions: developmental disorders, as well\ 
as disorders of the musculoskeletal and haematopoietic systems, and cancers 
including especially osteosarcoma and chondrosarcoma, but also other cancers 
including breast cancer, colon cancer, cardiac tumors, pancreatic cancer, melanoma, 
retinoblastoma, glioblastoma, lung cancer, intestinal cancer, testicular cancer, 

10 stomach cancer, neuroblastoma, myxoma, myoma, lymphoma, endothelioma, 

osteoblastoma, osteoclastoma, adenoma, and the like. Similarly, polypeptides and 
antibodies directed to those polypeptides are useful to provide immunological probes 
for differential identification of the tissue(s) or cell type(s). For a number of disorders 
of the above tissues or cells, particularly of the haemopoietic and musculoskeletal 

15 systems, as well as developmental disorders, expression of this gene at significantly 
higher or lower levels may be detected in certain tissues or cell types (e.g., synovium, 
placenta, stromal, immune, hematopoietic, skeletal, and cancerous and wounded 
tissues) or bodily fluids (e.g., lymph, serum, plasma, urine, synovial fluid or spinal 
fluid) taken from an individual having such a disorder, relative to the standard gene 

20 expression level, i.e., the expression level in healthy tissue from an individual not 
having the disorder. 

Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 
164-as residues: Pro-81 to Ser-88. ■ - - 

The tissue distribution in placenta suggests that the protein product of this 

25 clone would be useful for treatment and diagnosis of developmental disorders. 

Polynucleotides and polypeptides of the present invention can be used diagnostically 
and therapeutically to detect and treat many cancers, particularly osteosarcoma and 

chondrosarcoma-^ addition, the expressionof this gene-product in synoviunrwould ~ 

suggest a role in the detection and treatment of disorders and conditions affecting the 

30 skeletal system, in particular osteoporosis, bone cancer, as well as, disorders afflicting 
connective tissues (e.g. arthritis, trauma, tendonitis, chrondomalacia and 
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inflammation), such as in the diagnosis or treatment of various autoimmune disorders 
such as rheumatoid arthritis, lupus, scleroderma, and dermatomyositis as well as 
dwarfism, spinal deformation, and specific joint abnormalities as well as 
chondrodysplasias (i.e. spondyloepiphyseal dysplasia congenita, familial 
5 osteoarthritis, Atelosteogenesis type II, metaphyseal chondrodysplasia type Schmid). 

Moreover, the protein is useful in the detection, treatment, and/or prevention 
of a variety of vascular disorders and condtions, which include, but are not limited to 
miscrovascular disease, vascular leak syndrome, aneurysm, stroke, embolism, 
thrombosis, coronary artery disease, arteriosclerosis, and/or atherosclerosis. Protein, 

10 as well as, antibodies directed against the protein may show utility as a tumor marker 
and/or immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO:67 and may have been publicly available prior to conception of 

15 the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence 
would be cumbersome. Accordingly, preferably excluded from the present invention 
are one or more polynucleotides comprising a nucleotide sequence described by the 
general formula of a-b, where a is any integer between 1 to 1249 of SEQ ID NO:67, b 

20 is an integer of 15 to 1263, where both a and b correspond to the positions of 

nucleotide residues shown in SEQ ID NO: 67, and where b is greater than or equal to a 
+ 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 58 
25 Contact of cells with supernatant expressing the product of this gene has been 

shown to increase the permeability of the plasma membrane of renal messiaglia cells 

to calcium. Thus it is likely that the product of this gene is involved in a signal 
transduction pathway that is initiatedwhenthe product binds a receptor on the surface" 

of the plasma membrane of renal and developing cells and tissuesThus, 
30 polynucleotides and polypeptides have uses which include, but are not limited to, 

activating renal and developing cells and tissues. 
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In specific embodiments, polypeptides of the invention comprise the following 
amino acid sequence: YCNLQCR (SEQ ID NO:273). Polynucleotides encoding these 
polypeptides are also encompassed by the invention. 

It has been discovered that this gene is expressed primarily in the whole 
5 developing embryo, as well as in ovarian cancer and placenta. 

Therefore, nucleic acids of the invention are useful as reagents for differential 
identification of the tissue(s) or cell type(s) present in a biological sample and for 
diagnosis of the following diseases and conditions: developmental or reproductive 
diseases and/or disorders, in addition to the following and ovarian cancer, as well as 

10 other cancers including breast cancer, colon cancer, cardiac tumors, pancreatic cancer, 
melanoma, retinoblastoma, glioblastoma, lung cancer, intestinal cancer, testicular 
cancer, stomach cancer, neuroblastoma, myxoma, myoma, lymphoma, endothelioma, 
osteoblastoma, osteoclastoma, osteosarcoma, chondrosarcoma, adenoma, and the like. 
Similarly, polypeptides and antibodies directed to those polypeptides are useful to 

15 provide immunological probes for differential identification of the tissue(s) or cell 
type(s). For a number of disorders of the above tissues or cells, particularly of the 
developing and fetal system, expression of this gene at significantly higher or lower 
levels may be detected in certain tissues or cell types (e.g., developmental, 
reproductive, and cancerous and wounded tissues) or bodily fluids (e.g., lymph, 

20 amniotic fluid, serum, plasma, urine, synovial fluid or spinal fluid) taken from an 

individual having such a disorder, relative to the standard gene expression level, i.e., 
the expression level in healthy tissue from an individual not having the disorder. 

The tissue distribution in embryonic and ovarian tissue, combined with the 
detected calcium flux activity, suggests that the protein product of this clone would be 

25 useful for tretment and diagnosis of developmental disorders as well as ovarian and 
other cancers. Expression within embryonic tissue and other cellular sources marked 
by proliferating cells suggests this protein may play a role in the regulation of cellular 
divisiolirandmay^how utilityin th^iaghosis arid treatment "of cancer and other 
proliferative disorders. Similarly, developmental tissues rely on decisions involving 

30 cell differentiation and/or apoptosis in pattern formation. Dysregulation of apoptosis 
can result in inappropriate suppression of cell death, as occurs in the development of 
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some cancers, or in failure to control the extent of cell death, as is believed to occur in 
acquired immunodeficiency and certain neurodegenerative disorders, such as spinal 
muscular atrophy (SMA). 

Therefore, the polynucleotides and polypeptides of the present invention are 
5 useful in treating, detecting, and/or preventing said disorders and conditions, in 
addition to other types of degenerative conditions. Thus this protein may modulate 
apoptosis or tissue differentiation and would be useful in the detection, treatment, 
and/or prevention of degenerative or proliferative conditions and diseases. 
Alternatively, the protein is useful in the detection, treatment, and/or prevention of 

10 vascular conditions, which include, but are not limited to, microvascular disease, 
vascular leak syndrome, aneurysm, stroke, atherosclerosis, arteriosclerosis, or 
embolism. Protein, as well as, antibodies directed against the protein may show utility 
as a tumor marker and/or immunotherapy targets for the above listed tissues. 
Many polynucleotide sequences, such as EST sequences, are publicly 

15 available and accessible through sequence databases. Some of these sequences are 

related to SEQ ID NO: 68 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence 
would be cumbersome. Accordingly, preferably excluded from the present invention 

20 are one or more polynucleotides comprising a nucleotide sequence described by the 
general formula of a-b, where a is any integer between 1 to 1603 of SEQ ID NO:68, b 
is an integer of 15 to 1617, where both a and b correspond to the positions of 
nucleotide residues shown in SEQ ID NO:68; and where b is greater than or equal to a 
+ 14. 

25 

FEATURES OF PROTEIN ENCODED BY GENE NO: 59 

In specific embodiments, polypeptides of the invention comprise the following 
amino acidsequenceT S ALIGNPKGCFGCF^ 

LKTNFR (SEQ ID NO:274). Polynucleotides encoding these polypeptides are also 
encompassed by the invention. 
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It has been discovered that this gene is expressed primarily in hypothalamus 
and anergic T cells. 

Therefore, nucleic acids of the invention are useful as reagents for differential 
identification of the tissue(s) or cell type(s) present in a biological sample and for 
5 diagnosis of the following diseases and conditions: neurological and inflammatory 
defects, diseases, and/or disorders. Similarly, polypeptides and antibodies directed to 
those polypeptides are useful to provide immunological probes for differential 
identification of the tissue(s) or cell type(s). For a number of disorders of the above 
tissues or cells 5 particularly of the central nervous and immune systems, expression of 

10 this gene at significantly higher or lower levels may be detected in certain tissues 

(e.g., neural, immune, hematopoietic, and cancerous and wounded tissues) or bodily 
fluids (e.g., lymph, serum, plasma, urine, synovial fluid or spinal fluid) taken from an 
individual having such a disorder, relative to the standard gene expression level, i.e., 
the expression level in healthy tissue from an individual not having the disorder. 

15 Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 

166 as residues: His-33 to Trp-38. 

The tissue distribution in hypothalamus and T-cells suggests that the protein 
product of this clone would be useful for study and treatment of immune and nervous 
system disorders. The protein product of this clone is useful for the detection, 

20 treatment, and/or prevention of neurodegenerative disease states, behavioral 
disorders, or inflammatory conditions which include, but are not limited to 
Alzheimer's Disease, Parkinson's Disease, Huntington's Disease, Tourette Syndrome, 

7 meningitis, encephalitis, demyelinating diseases, peripheral neuropathies, neoplasia, - 
trauma, congenital malformations, spinal cord injuries, ischemia and infarction, 

25 aneurysms, hemorrhages, schizophrenia, mania, dementia, paranoia, obsessive 
compulsive disorder, depression, panic disorder, learning disabilities, ALS, 
psychoses, autism, and altered behaviors, including disorders in feeding, sleep 
patterns, balance, and perception. In addition, elevated 

Expression of this gene product in regions of the brain suggests it plays a role 

30 in normal neural function. Potentially, this gene product is involved in synapse 
formation, neurotransmission, learning, cognition, homeostasis, or neuronal 
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differentiation or survival. Morever, the expression of this gene product suggests a 
role in regulating the proliferation; survival; differentiation; and/or activation of 
hematopoietic cell lineages, including blood stem cells. This gene product may be 
involved in the regulation of cytokine production, antigen presentation, or other 
5 processes suggesting a usefulness in the treatment of cancer- (e.g. by boosting immune 
responses). 

Since the gene is expressed in cells of lymphoid origin, the natural gene 
product may be involved in immune functions. Therefore it may be also used as an 
agent for immunological disorders including arthritis, asthma, immunodeficiency 

10 diseases such as AIDS, leukemia, rheumatoid arthritis, granulomatous disease, 
inflammatory bowel disease, sepsis, acne, neutropenia, neutrophilia, psoriasis, 
hypersensitivities, such as T-cell mediated cytotoxicity; immune reactions to 
transplanted organs and tissues, such as host-versus-graft and graft-versus-host 
diseases, or autoimmunity disorders, such as autoimmune infertility, lense tissue 

15 injury, demyelination, systemic lupus erythematosis, drug induced hemolytic anemia, . 
rheumatoid arthritis, Sjogren's disease, scleroderma and tissues. Moreover, the protein 
may represent a secreted factor that influences the differentiation or behavior of other 
blood cells, or that recruits hematopoietic cells to sites of injury. In addition, this gene 
product may have commercial utility in the expansion of stem cells and committed 

20 progenitors of various blood lineages, and in the differentiation and/or proliferation of 
various cell types. Protein, as well as, antibodies directed against the protein may 
show utility as a tumor marker and/or immunotherapy targets for the above listed 
tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
25 available and accessible through sequence databases. Some of these sequences are 

related to SEQ ID NO:69 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded fr6nvthe~scope of the present invSuibn7T6"list^^^ 
would be cumbersome. Accordingly, preferably excluded from the present invention 
30 are one or more polynucleotides comprising a nucleotide sequence described by the 
general formula of a-b, where a is any integer between 1 to 1375 of SEQ ID NO:69 t b 
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is an integer of 15 to 1389, where both a and b correspond to the positions of 
nucleotide residues shown in SEQ ID NO:69, and where b is greater than or equal to a 
+ 14. 

5 FEATURES OF PROTEIN ENCODED BY GENE NO: 60 

The translation product of this gene shares nucleotide sequence homology 
with the human PKD1 gene which is thought to be important in polycystic kidney 
disease. 

This gene is expressed widely with a predominant expression exhibited in 

10 liver, pediatric kidney, and in the whole 8 week old developing human embryo. 

Therefore, nucleic acids of the invention are useful as reagents for differential 
identification of the tissue(s) or cell type(s) present in a biological sample and for 
diagnosis of the following diseases and conditions: cancer, growth, renal, and 
metabolic defects, diseases, and/or disorders. Similarly, polypeptides and antibodies 

15 directed to those"polypeptides are useful to provide immunological probes for 

differential identification of the tissue(s) or cell type(s). For a number of disorders of 
the above tissues or cells, particularly of the endocrine, digestive and immune 
systems, expression of this gene at significantly higher or lower levels may be 
detected in certain tissues or cell types (e.g., renal, metabolic, hepatic, developmental, 

20 and cancerous and wounded tissues) or bodily fluids (e.g., lymph, amniotic fluid, bile, 
serum, plasma, urine, synovial fluid or spinal fluid) taken from an individual having 
such a disorder, relative to the standard gene expression level, i.e., the expression 
level in healthy tissue from an individual not having the disorder. 

The tissue distribution in pediatric kidney suggests that the protein product of 

25 this clone would be useful for study and treatment of renal and general neoplasias and 
growth and development disorders. The protein product of this clone could be used in 
the treatment and/or detection of kidney diseases including renal failure, nephritus, 
; renal tubular acidosis," pro^ 

nephrotic syndrome, crush syndrome, glomerulonephritis, hematuria, renal colic and 

30 kidney stones, in addition to Wilm's Tumor Disease, and congenital kidney 

abnormalities such as horseshoe kidney, polycystic kidney, and Falconi's syndrome. 
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Moreover, the expression within embryonic tissue suggests this protein may 
play a role in the regulation of cellular division, and may show utility in the diagnosis 
and treatment of cancer and other proliferative disorders, particularly of the liver and 
other organs. Similarly, developmental tissues rely on decisions involving cell 
5 differentiation and/or apoptosis in pattern formation. Dysregulation of apoptosis can 
result in inappropriate suppression of cell death, as occurs in the development of some 
cancers, or in failure to control the extent of cell death, as is believed to occur in 
acquired immunodeficiency and certain neurodegenerative disorders, such as spinal 
muscular atrophy (SMA). Therefore, the polynucleotides and polypeptides of the 

10 present invention are useful in treating, detecting, and/or preventing said disorders 

and conditions, in addition to other types of degenerative conditions. Thus this protein 
may modulate apoptosis or tissue differentiation and would be useful in the detection, 
treatment, and/or prevention of degenerative or proliferative conditions and diseases. 
Protein, as well as, antibodies directed against the protein may show utility as a tumor 

15 marker and/or immunotherapy targets" for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO:70 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 

20 excluded from the scope of the present invention. To list every related sequence 

would be cumbersome. Accordingly, preferably excluded from the present invention 
are one or more polynucleotides comprising a nucleotide sequence described by the 
general formula of a-b, where a is any integer between 1 to 1882 of SEQ ID NO:70, b 
is an integer of 15 to 1896, where both a and b correspond to the positions of 

25 nucleotide residues shown in SEQ ID NO:70, and where b is greater than or equal to a 
+ 14. 

FEATURES OF PROTEIN-ENGODED BY GENE- NO r 61— 



In specific embodiments, polypeptides of the invention comprise the following 
30 amino acid sequence: HEAALRGP (SEQ ID NO:275). Polynucleotides encoding 
these polypeptides are also encompassed by the invention. 
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It has been discovered that this gene is expressed primarily in human striatum 
depression. 

Therefore, nucleic acids of the invention are useful as reagents for differential 
identification of the tissue(s) or cell type(s) present in a biological sample and for 
5 diagnosis of the following diseases and conditions: stroke, in addition to other, 

neurologically-related diseases and/or defects. Similarly, polypeptides and antibodies 
directed to those polypeptides are useful to provide immunological probes for 
differential identification of the tissue(s) or cell type(s). For a number of disorders of 
the above tissues or cells, particularly of the central nervous system, expression of 
10 this gene at significantly higher or lower levels may be detected in certain tissues 
(e.g., neural, musculoskeletal, and cancerous and wounded tissues) or bodily fluids 
(e.g., lymph, serum, plasma, urine, synovial fluid or spinal fluid) taken from an 
individual having such a disorder, relative to the standard gene expression level, i.e., 
the expression level in healthy tissue from an individual not having the disorder. 
15 Preferred epitopes include; those comprising a sequence shown in SEQ ID NO. 

1 68 as residues : Glu-50 to Glu-6 1 . 

The tissue distribution in human striatum depression suggests that the protein 
product of this clone would be useful for study and treatment of central nervous 
system orders, such as seizures and other neurological conditions. The protein product 
20 of this clone is useful for the detection, treatment, and/or prevention of 

neurodegenerative disease states, behavioral disorders, or inflammatory conditions 
which include, but are not limited to Alzheimer's Disease, Parkinson's Disease, 
Huntington's Disease, Tourette Syndrome, meningitis, encephalitis, demyelinating 
diseases, peripheral neuropathies, neoplasia, trauma, congenital malformations, spinal 
cord injuries, ischemia and infarction, aneurysms, hemorrhages, schizophrenia, 
mania, dementia, paranoia, obsessive compulsive disorder, depression, panic disorder, 
learning disabilities, ALS, psychoses, autism, and altered behaviors, including 

disorders-in- feeding, sleep patterns, balance, and-perception. -In addition; elevated 

Expression of this gene product in regions of the brain suggests it plays a role 
in normal neural function. Potentially, this gene product is involved in synapse 
formation, neurotransmission, learning, cognition, homeostasis, or neuronal 
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differentiation or survival. Protein, as well as, antibodies directed against the protein 
may show utility as a tumor marker and/or immunotherapy targets for the above listed 
tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO:71 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence 
would be cumbersome. Accordingly, preferably excluded from the present invention 
are one or more polynucleotides comprising a nucleotide sequence described by the 
general formula of a-b, where a is any integer between 1 to 294 of SEQ ID NO:71, b 
is an integer of 15 to 308, where both a and b correspond to the positions of 
nucleotide residues shown in SEQ ID NO:71, and where b is greater than or equal to a 
+ 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 62 

This clone has homology to a cystine rich granulin peptide(s) from 
leucocyte(s) which has been termed Granulin E. Granulins inhibit keratinocytes and is 
useful topically for wound healing. The gene encoding the disclosed cDNA is 
believed to reside on chromosome 3. Accordingly, polynucleotides related to this 
invention are useful as a marker in linkage analysis for chromosome 3. 

It has been discovered that this gene is expressed primarily in infant brain. 
Therefore, nucleic acids of the invention are useful as reagents for differential 
identification of the tissue(s) or cell type(s) present in a biological sample and for 
diagnosis of the following diseases and conditions: neurological, developmental, and 
growth defects. Similarly, polypeptides and antibodies directed to those polypeptides 
are useful to provide immunological probes for differential identification of the 

tissue(s) or cell-type(s).-For-a-number of-disorders oftheabove tissuesor cells7 ~ 

particularly of the fetus and the nervous system, expression of this gene at 
significantly higher or lower levels may be detected in certain tissues (e.g., neural, 
developmental, growth, and cancerous and wounded tissues) or bodily fluids (e.g., 
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lymph, amniotic fluid, serum, plasma, urine, synovial fluid or spinal fluid) taken from 
an individual having such a disorder, relative to the standard gene expression level, 
i.e., the expression level in healthy tissue from an individual not having the disorder. 
Based on the strong conservation of cysteine residues, the polypeptide of the present 
5 invention can be used to inhibit keratinocytes and promote wound healing. 

The tissue distribution in infant brain suggests that the protein product of this 
clone would be useful for study and treatment of nervous system, neurodegenerative 
and developmental disorders. The protein product of this clone is useful for the 
detection, treatment, and/or prevention of neurodegenerative disease states, 
10 behavioral disorders, or inflammatory conditions which include, but are not limited to 
Alzheimer's Disease, Parkinson's Disease, Huntington's Disease, Tourette Syndrome, 
meningitis, encephalitis, demyelinating diseases, peripheral neuropathies, neoplasia, 
trauma, congenital malformations, spinal cord injuries, ischemia and infarction, 
aneurysms, hemorrhages, schizophrenia, mania, dementia, paranoia, obsessive 
15 compulsive disorder, depression, panic disorder, learning disabilities, ALS, 

psychoses, autism, and altered behaviors, including disorders in feeding, sleep 
patterns, balance, and perception. In addition, elevated 

Expression of this gene product in regions of the brain suggests it plays a role 
in normal neural function. Potentially, this gene product is involved in synapse 
20 formation, neurotransmission, learning, cognition, homeostasis, or neuronal 

differentiation or survival. Protein, as well as, antibodies directed against the protein 
may show utility as a tumor marker and/or immunotherapy targets for the above listed 
tissues. The homology to granulin proteins suggest the protein product of this clone is 
useful for the treatment, diagnosis, and/or prevention of various skin disorders 
25 including congenital disorders (i.e. nevi, moles, freckles, Mongolian spots, 

hemangiomas, port-wine syndrome), integumentary tumors (i.e. keratoses, Bowen's 
disease, basal cell carcinoma, squamous cell carcinoma, malignant melanoma, Paget' s 

disease,_mycosis.fungoides, and Kaposi's sareoma) r injuries-and inflammation of the 

skin (i.e.wounds, rashes, prickly heat disorder, psoriasis, dermatitis), atherosclerosis, 
30 uticaria, eczema, photosensitivity, autoimmune disorders (i.e. lupus erythematosus, 
vitiligo, dermatomyositis, morphea, scleroderma, pemphigoid, and pemphigus), 
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keloids, striae, erythema, petechiae, purpura, and xanthelasma. In addition, such 
disorders may predispose increased susceptibility to viral and bacterial infections of 
the skin (i.e. cold sores, warts, chickenpox, molluscum contagiosum, heipes zoster, 
boils, cellulitis, erysipelas, impetigo, tinea, althletes foot, and ringworm). Moreover, 
5 the protein product of this clone may also be useful for the treatment or diagnosis of 
various connective tissue disorders such as arthritis, trauma, tendonitis, 
chrondomalacia and inflammation, autoimmune disorders such as rheumatoid 
arthritis, lupus, scleroderma, and dermatomyositis as well as dwarfism, spinal 
deformation, and specific joint abnormalities as well as chondrodysplasias (i.e. 

10 spondyloepiphyseal dysplasia congenita, familial osteoarthritis, Atelosteogenesis type 
II, metaphyseal chondrodysplasia type Schmid). Protein, as well as, antibodies 
directed against the protein may show utility as a tumor marker and/or 
immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 

15 available and accessible through sequence databases. Some of these sequences are 

related to SEQ ID NO:72 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence 
would be cumbersome. Accordingly, preferably excluded from the present invention 

20 are one or more polynucleotides comprising a nucleotide sequence described by the 
general formula of a-b, where a is any integer between 1 to 1674 of SEQ ID NO:72, b 
is an integer of 15 to 1688, where both a and b correspond to the positions of 

nucleotide residues showninSEQ IDNO:72, andwhere b is"greater thanor equalto a 

+ 14. 

25 

FEATURES OF PROTEIN ENCODED BY GENE NO: 63 

In specific embodiments, polypeptides of the invention comprise the following 

amino"acid"sequen^ 

Polynucleotides encoding these polypeptides are also encompassed by the invention. 
30 It has been discovered that this gene is expressed primarily in prostate cancer 

and dendritic cells. 
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Therefore, nucleic acids of the invention are useful as reagents for differential 
identification of the tissue(s) or cell type(s) present in a biological sample and for 
diagnosis of the following diseases and conditions: reproductive, immune, and 
hematopoietic diseases, defects and/or disorders. Similarly, polypeptides and 
5 antibodies directed to those polypeptides are useful to provide immunological probes 
for differential identification of the tissue(s) or cell type(s). For a number of disorders 
of the above tissues or cells, particularly of the endocrine and immune systems, 
• expression of this gene at significantly higher or lower levels may be detected in 
certain tissues or cell types (e.g., reproductive, immune, hematopoietic, and cancerous 
10 and wounded tissues) or bodily fluids (e.g., lymph, seminal fluid, serum, plasma, 

urine, synovial fluid or spinal fluid) taken from an individual having such a disorder, 
relative to the standard gene expression level, i.e., the expression level in healthy 
tissue from an individual not having the disorder. 

Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 
15 170 as residues: Tfp-47 to Thr-54. 

The tissue distribution in prostate cells and tissues indicates that the protein 
products of this clone are useful for study, diagnosis and treatment of neoplasias, esp. 
of the prostate, and hormonal and metabolic disorders. Moreover, the protein product 
of this clone is useful for the treatment and diagnosis of hematopoietic related 
20 disorders such as anemia, pancytopenia, leukopenia, thrombocytopenia or leukemia 
since stromal cells are important in the production of cells of hematopoietic lineages. 
The uses include bone marrow cell ex- vivo culture, bone marrow transplantation, 
bone marrow reconstitution, radiotherapy or chemotherapy of neoplasia. The gene 
product may also be involved in lymphopoiesis, therefore, it can be used in immune 
25 disorders such as infection, inflammation, allergy, immunodeficiency etc. In addition, 
this gene product may have commercial utility in the expansion of stem cells and 
committed progenitors of various blood lineages, and in the differentiation and/or 
pr61ifeTation~of various cell"t>^s7Prdtein7aTwelI as^ antibodies ^directed againsftfie" ~ 
protein may show utility as a tumor marker and/or immunotherapy targets for the 
30 above listed tissues. 
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Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO:73 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
5 excluded from the scope of the present invention. To list every related sequence 

would be cumbersome. Accordingly, preferably excluded from the present invention 
are one or more polynucleotides comprising a nucleotide sequence described by the 
general formula of a-b, where a is any integer between 1 to 1 124 of SEQ ID NO:73, b 
is an integer of .15 to 1 138, where both a and b correspond to the positions of 
10 nucleotide residues shown in SEQ ID NO:73, and where b is greater than or equal to a 
+ 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 64 

In specific embodiments, polypeptides of the invention comprise the following 
15 amino acid sequence: NWAVLNMLLSKGKITIFLGPLECGS (SEQ ID NO:277). 
Polynucleotides encoding these polypeptides are also encompassed by the invention. 

It has been discovered that this gene is expressed primarily in B cell 
lymphoma. 

Therefore, nucleic acids of the invention are useful as reagents for differential 
20 identification of the tissue(s) or cell type(s) present in a biological sample and for 
diagnosis of the following diseases and conditions: immune and hematopoietic 
diseases, disorders, and/or defects, particularly cancers. Similarly, polypeptides and 

antibodies-directed-to those-polypeptidesareusefulto provideimmunologicalprobes- 

for differential identification of the tissue(s) or cell type(s). For a number of disorders 
25 of the above tissues or cells, particularly of the hemopoietic and immune systems, 
expression of this gene at significantly higher or lower levels may be detected in 
certain tissues or cell types (e.g., immune, hematopoietic, and cancerous and wounded 
tissues) or bodily fluids~(e:g~lymph^ 

fluid) taken from an individual having such a disorder, relative to the standard gene 
30 expression level, i.e., the expression level in healthy tissue from an individual not. 
having the disorder. 
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The tissue distribution in B cell lymphoma suggests that the protein product of 
this clone would be useful for study and treatment of blood and immune disorders and 
neoplasias, esp. of the lymphatic system. The protein product of this clone is useful 
for the treatment and diagnosis of hematopoietic related disorders such as anemia, 
5 pancytopenia, leukopenia, thrombocytopenia or leukemia since stromal cells are 

important in the production of cells of hematopoietic lineages. The uses include bone 
marrow cell ex- vivo culture, bone marrow transplantation, bone marrow 
reconstitution, radiotherapy or chemotherapy of neoplasia. The gene product may also 
be involved in .lymphopoiesis, therefore, it can be used in immune disorders such as 

10 infection, inflammation, allergy, immunodeficiency etc. In addition, this gene product 
may have commercial utility in the expansion of stem cells and committed . 
progenitors of various blood lineages, and in the differentiation and/or proliferation of 
various cell types. Protein, as well as, antibodies directed against the protein may 
show utility as a tumor marker and/or immunotherapy targets for the above listed 

15 tissues. " - 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO:74 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 

20 excluded from the scope of the present invention. To list every related sequence 

would be cumbersome. Accordingly, preferably excluded from the present invention 
are one or more polynucleotides comprising a nucleotide sequence described by the 
general formula of a-b, where a is any integer between I to 763 of SEQ ID NO:74, b 
is an integer of 15 to 777, where both a and b correspond to the positions of 

25 nucleotide residues shown in SEQ ID NO:74, and where b is greater than or equal to a 
+ 14. 

FEATURES OF PROTEIN ENCODED BY GENENOT65 

It has been discovered that this gene is expressed primarily in B cell 
30 lymphoma. 
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Therefore, nucleic acids of the invention are useful as reagents for differential 
identification of the tissue(s) or cell type(s) present in a biological sample and for 
diagnosis of the following diseases and conditions: immune and hematopoietic 
diseases, disorders, and/or defects, particularly cancer. Similarly, polypeptides and 
antibodies directed to those polypeptides are useful to provide immunological probes 
for differential identification of the tissue(s) or cell type(s). For a number of disorders 
of the above tissues or cells, particularly of the hemopoietic and immune systems, 
expression of this gene at significantly higher or lower levels may be detected in 
certain tissues or cell types (e.g., immune, hematopoietic, and cancerous and wounded 
tissues) or bodily fluids (e.g:, lymph, serum, plasma, urine, synovial fluid or spinal 
fluid) taken from an individual having such a disorder, relative to the standard gene 
expression level, i.e., the expression level in healthy tissue from an individual not 
having the disorder. 

The tissue distribution in B cell lymphoma suggests that the protein product of 
this clone would be'useful for study arid treatment of neplasias, esp. of lymphatic 
organs, and immune disorders. The protein product of this clone is useful for the 
treatment and diagnosis of hematopoietic related disorders such as anemia, 
pancytopenia, leukopenia, thrombocytopenia or leukemia since stromal cells are 
important in the production of cells of hematopoietic lineages. The uses include bone 
marrow cell ex- vivo culture, bone marrow transplantation, bone marrow 
reconstitution, radiotherapy or chemotherapy of neoplasia. The gene product may also 
be involved in lymphopoiesis, therefore, it can be used in immune disorders such as 
infection, inflammation, allergy, immunodeficiency etc. In addition, this gene product 
may have commercial utility in the expansion of stem cells and committed 
progenitors of various blood lineages, and in the differentiation and/or proliferation of 
various cell types. Protein, as well as, antibodies directed against the protein may 
show utility as a tumor marker and/or immunotherapy targets for the above listed 
"tissues; ~ 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO:75 and may have been publicly available prior to conception of 
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the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence 
would be cumbersome. Accordingly, preferably excluded from the present invention 
are one or more polynucleotides comprising a nucleotide sequence described by the 
5 general formula of a-b, where a is any integer between 1 to 1046 of SEQ ID NO:75, b 
is an integer of 15 to 1060, where both a and b correspond to the positions of 
nucleotide residues shown in SEQ ID NO:75, and where b is greater than or equal to a 
+ 14. 

10 FEATURES OF PROTEIN ENCODED BY GENE NO: 66 

The translation product of this gene shares sequence homology with a rat 
protein phosphatase, in addition to, a human heterogeneous nuclear ribonucleoprotein 
R (See Genbank Accession No.gil2697103 (AF000364)). When tested against PC 12 
cell lines, supernatants removed from cells containing this gene activated the EGR1 

15 (early growth response; gene 1). promoter element. Thus, it is likely that this gene 

activates sensory neuron cells through the EGR1 signal transduction pathway. EGR1 
is a separate signal transduction pathway from Jak-STAT, genes containing the EGR1 
promoter are induced in various tissues and cell types upon activation, leading the 
cells to undergo differentiation and proliferation. This gene also showed activity in 

20 sensory neurons using the EGR assay described in the Example section. 

In specific embodiments, polypeptides of the invention comprise the following 
amino acid sequence: PSHQTRKGKSAKLLDRPPEALRMKnTTTLLLACHLQLEV 
GVVVGGEVD ( S E Q I D N 0:278), 

FQASSANNQQNWGSQPIAQQPLQQGGDYSG 

25 NYGYNNDNQEFYQDTYGQQWK (SEQ ID NO:279), WXPLLXTSGSPGLXGFG 
TRMNGKEIEGEEIEIVLAKPPDKKRKERQAARQASRSTAYEDYYYHPPPRMPP 



PIRGRGRGGGRGGYGYPPDYYGYEDYYDDYYGYDYHDYRGGYEDPYYGYD 




RGGPAQQQRGRGSRGSRGNRGGNVGGKRKADGYNQPDSKRRQPTTNRTGV 
30 PNPSLSSRFSKVVTILVTMVTIMTTRNFIRILMGNSGSRQVRA (SEQ ID 



NO:280), RMNGKEIEGEEIEIVLAKPPDKKRKER (SEQ ID NO:281), YYHPPP 
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RMPP PIRGRGRGGGRGGYG (SEQ ID NO:282), DYRGGYEDPYYGYDDGYAV 
RGRGGGR (SEQ ID NO:283), PPPRGRAGYSQRGAPLGPPRGSRGGRGG (SEQ 
ID NO:284), and/or ADGYNQPDSK RRQPTTNRTGVPNPSLSSRFSKVVT (SEQ 
ID NO:285). Polynucleotides encoding these polypeptides are also encompassed by 
5 the invention. The gene encoding the disclosed cDNA is believed to reside on 
chromosome 1. Accordingly, polynucleotides related to this invention are useful as 
a marker in linkage analysis for chromosome 1 . 

It has been discovered that this gene is expressed primarily in human primary 
breast cancer, lung, and leukocytes. 

10 Therefore, nucleic acids of the invention are useful as reagents for differential 

identification of the tissue(s) or cell type(s) present in a biological sample and for 
diagnosis of the following diseases and conditions: reproductive, immune, or 
pulmonary diseases and/or disorders, particularly breast cancer. Similarly, 
polypeptides and antibodies directed to those polypeptides are useful to provide 

15 immunological probes for differential identification of the tissue(s) or cell type(s). For 
a number of disorders of the above tissues or cells, particularly of the reproductive, 
immune and respiratory systems, expression of this gene at significantly higher or 
lower levels may be detected in certain tissues or cell types (e.g., reproductive, 
immune, pulmonary, and cancerous and wounded tissues) or bodily fluids (e.g., 

20 lymph, serum, plasma, urine, synovial fluid or spinal fluid) taken from an individual 
having such a disorder, relative to the standard gene expression level, i.e., the 
expression level in healthy tissue from an individual not having the disorder. 

The tissue-distribution inbreast cancer cells and tissues, in addition to immune 

cells, combined with the homology to a protein phosphatase suggests that the protein 

25 product of this clone would be useful for diagnosis and treatment of breast cancer and 
abnormalities of the lung and the immune system. Morever, the expression of this 
gene product suggests a role in regulating the proliferation; survival; differentiation; 

and/or-activation*of-hematopoietie eelHineagesrincludingblood stenrcells.^Fhis gene 

product may be involved in the regulation of cytokine production, antigen 

30 presentation, or other processes suggesting a usefulness in the treatment of cancer 
(e.g. by boosting immune responses). 
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Since the gene is expressed in cells of lymphoid origin, the natural gene 
product may be involved in immune functions. Therefore it may be also used as an 
agent for immunological disorders including arthritis, asthma, immunodeficiency 
diseases such as AIDS, leukemia, rheumatoid arthritis, granulomatous disease, 
5 inflammatory bowel disease, sepsis, acne, neutropenia, neutrophilia, psoriasis, 
hypersensitivities, such as T-cell mediated cytotoxicity; immune reactions to 
transplanted organs and tissues, such as host-versus-graft and graft-versus-host 
diseases, or autoimmunity disorders, such as autoimmune infertility, lense tissue 
injury, demyelination, systemic lupus erythematosis, drug induced hemolytic anemia, 

10 rheumatoid arthritis, Sjogren's disease, scleroderma and tissues. 

Moreover, the protein may represent a secreted factor that influences the 
differentiation or behavior of other blood cells, or that recruits hematopoietic cells to 
sites of injury. In addition, this gene product may have commercial utility in the 
expansion of stem cells and committed progenitors of various blood lineages, and in 

15 the differentiation and/or proliferation of various cell types. The protein is useful in 
modulating the immune response to aberrant cells and cell types, particularly 
proliferative cells (e.g. protein may increase the immunogenicity of tumor antigens 
either directly or indirectly, or may activate apoptosis). The protein is useful in 
treating, detecting, and/or preventing various pulmonary disorders, which include, but 

20 are not limited to, ARDS, emphysema, and cystic fibrosis. Protein, as well as, 

antibodies directed against the protein may show utility as a tumor marker and/or 
immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 

25 related to SEQ ID NO:76 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence 

would-be cumbersome^ Accordingly r preferably excluded from the present invention 

are one or more polynucleotides comprising a nucleotide sequence described by the 

30 general formula of a-b, where a is any integer between 1 to 1489 of SEQ ID NO:76, b 
is an integer of 15 to 1503, where both a and b correspond to the positions of 
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nucleotide residues shown in SEQ ID NO:76, and where b is greater than or equal to a 
+ 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 67 
5 In specific embodiments, polypeptides of the invention comprise the following 

amino acid sequence: LQIPPSSQSLGLKNADSSI (SEQ ID NO:286), GGPPESAPW 
LPAVLRAPVLTSRCASSDSEGPVWFCQPGSGPSSTEMSCHCILGPGSSCLCVL 
RGSMWTPSVPGWPQPAKETGASSCSVFSANNGSCPLPLHNHQRQASLDTGL 
SLEHVPGES YFYSPVG (SEQ ID NO:287), SSDSEGPVWFCQPGSGPSSTEMSC 
10 HCILGPGSSC (SEQ ID NO:288), WTPSVPGWPQPAKETGASSCSVFSANNG 
(SEQ ID NO:289), and/or QRQASLDTGL SLEHVPGES YF (SEQ ID NO:290). 
Polynucleotides encoding these polypeptides are also encompassed by the invention. 

It has been discovered that this gene is expressed primarily in human B cell 
lymphoma. 

15 Therefore; nucleic acids of the invention are useful as reagents for differential 

identification of the tissue(s) or cell type(s) present in a biological sample and for 
diagnosis of the following diseases and conditions: immune or hematopoietic diseases 
and/or disorders, particularly B cell lymphoma. Similarly, polypeptides and 
antibodies directed to those polypeptides are useful to provide immunological probes 

20 for differential identification of the tissue(s) or cell type(s). For a number of disorders 
of the above tissues or cells, particularly of the immune system, expression of this 
gene at significantly higher or lower levels may be detected in certain tissues or cell 
types (e.g., immune, hematopoietic, and cancerous and wounded tissues) or bodily 
fluids (e.g., lymph, serum, plasma, urine, synovial fluid or spinal fluid) taken from an 

25 individual having such a disorder, relative to the standard gene expression level, i.e., 
the expression level in healthy tissue from an individual not having the disorder. 

The tissue distribution in B-cell lymphoma suggests that the protein product of 

this clone-would be-useful-for diagnosis and treatment of immune or hematopoietic — 

diseases and/or disorders, particularly proliferative conditions. Morever, the 

30 expression of this gene product suggests a role in regulating the proliferation; 

survival; differentiation; and/or activation of hematopoietic cell lineages, including 
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blood stern cells. This gene product may be involved in the regulation of cytokine 
production, antigen presentation, or other processes suggesting a usefulness in the 
treatment of cancer (e.g. by boosting immune responses). 

Since the gene is expressed in cells of lymphoid origin, the natural gene 
5 product may be involved in immune functions. Therefore it may be also used as an 
agent for immunological disorders including arthritis, asthma, immunodeficiency 
diseases such as AIDS, leukemia, rheumatoid arthritis, granulomatous disease, 
inflammatory bowel disease, sepsis, acne, neutropenia, neutrophilia, psoriasis, 
hypersensitivities, such as T-cell mediated cytotoxicity; immune reactions to 

10 transplanted organs and tissues, such as host-versus-graft and graft- versus-host 
diseases, or autoimmunity disorders, such as autoimmune infertility, lense tissue 
injury, demyelination, systemic lupus erythematosis, drug induced hemolytic anemia, 
rheumatoid arthritis, Sjogren's disease, scleroderma and tissues. Moreover, the protein 
may represent a secreted factor that influences the differentiation or behavior of other 

15 blood cells, or that recruits hematopoietic cells to sites of injury. In addition, this gene 
product may have commercial utility in the expansion of stem cells and committed 
progenitors of various blood lineages, and in the differentiation and/or proliferation of 
various cell types. The uses include bone marrow cell ex- vivo culture, bone marrow 
transplantation, bone marrow reconstitution, radiotherapy or chemotherapy of 

20 neoplasia. The gene product may also be involved in lymphopoiesis, therefore, it can 
be used in immune disorders such as infection, inflammation, allergy, 
immunodeficiency etc. In addition, this gene product may have commercial utility in 
the expansion of stem cells and committed progenitors of various blood lineages, and - 
in the differentiation and/or proliferation of various cell types. Protein, as well as, 

25 antibodies directed against the protein may show utility as a tumor marker and/or 
immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 

available and accessible through sequence-databases— Some of these sequences are 

related to SEQ ID NO:77 and may have been publicly available prior to conception of 

30 the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence 
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would be cumbersome. Accordingly, preferably excluded from the present invention 
are one or more polynucleotides comprising a nucleotide sequence described by the 
general formula of a-b, where a is any integer between 1 to 858 of SEQ ID NO:77, b 
is an integer of 15 to 872, where both a and b correspond to the positions of 
5 nucleotide residues shown in SEQ ID NO:77, and where b is greater than or equal to a 
+ 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 68 

In specific embodiments, polypeptides of the invention comprise the following 
10 amino acid sequence: SSSLVLTIRSQTLFLASFIHSTSIFCALN (SEQ ID NO:291). 
Polynucleotides encoding these polypeptides are also encompassed by the invention. 

It has been discovered that this gene is expressed primarily in osteoarthritic 
cartilage. 

Therefore, nucleic acids of the invention are useful as reagents for differential 
15 identification of the tissue(s) or cell type(s) present in a biological sample and for 
diagnosis of the following diseases and conditions: osteoarthritis and other 
bone/cartilage disorders, particularly degenerative conditions. Similarly, polypeptides 
and antibodies directed to those polypeptides are useful to provide immunological 
probes for differential identification of these tissue(s) or cell type(s). For a number of 
20 disorders of the above tissues or cells, particularly of the skelatal system, expression 
of this gene at significantly higher or lower levels may be detected in certain tissues 
or cell types (e.g., skeletal, joint, autoimmune, and cancerous and wounded tissues) or 
bodily fluids (e.g., lymph, serum, plasma, urine, synovial fluid or spinal fluid) taken 
from an individual having such a disorder, relative to the standard gene expression 
25 level, i.e., the expression level in healthy tissue from an individual not having the 
disorder. 

The tissue distribution in osteoarthritic cartilage suggests that the protein 
product of this clone would be useful ^ 

of osteoarthritis. Moreover, the gene product is useful in the detection and treatment 
30 of disorders and conditions affecting the skeletal system, in particular osteoporosis, 

bone cancer, as well as, disorders afflicting connective tissues (e.g. arthritis, trauma, 
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tendonitis, chrondomalacia and inflammation), such as in the diagnosis or treatment 
of various autoimmune disorders such as rheumatoid arthritis, lupus, scleroderma, and 
dermatomyositis as well as dwarfism, spinal deformation, and specific joint 
abnormalities as well as chondrodysplasias (i.e. spondyloepiphyseal dysplasia 
5 congenita, familial osteoarthritis, Atelosteogenesis type II, metaphyseal 

chondrodysplasia type Schmid). Protein, as well as, antibodies directed against the 
protein may show utility as a tumor marker and/or immunotherapy targets for the 
above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 

10 available and accessible through sequence databases. Some of these sequences are 

related to SEQ ID NO:78 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence 
would be cumbersome. Accordingly, preferably excluded from the present invention 

15 are one or more polynucleotides, comprising a nucleotide sequence described by the 
general formula of a-b, where a is any integer between 1 to 559 of SEQ ID NO:78, b 
is an integer of 15 to 573, where both a and b correspond to the positions of 
nucleotide residues shown in SEQ ID NO:78, and where b is greater than or equal to a 
+ 14. 

20 

FEATURES OF PROTEIN ENCODED BY GENE NO: 69 

The gene encoding the disclosed cDNA is believed to reside on chromosome 
17. Accordingly, polynucleotides related to this invention are useful as a marker in 
linkage analysis for chromosome 17. 
25 It has been discovered that this gene is expressed primarily in fetal brain, 

pharynx carcinoma, and Hodgkin's lymphoma. 

Therefore, nucleic acids of the invention are useful as reagents for differential 

identification of the tissue(s) or-cell-type(s)-present- in a biological sample and for 

diagnosis of the following diseases and conditions: developmental and/or proliferative 
30 diseases and disorders, particularly pharynx carcinoma, and Hodgkin's lymphoma. 
Similarly, polypeptides and antibodies directed to those polypeptides are useful to 
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provide immunological probes for differential identification of the tissue(s) or cell 
type(s). For a number of disorders of the above tissues or cells, particularly of the 
digestive and immune systems, expression of this gene at significantly higher or 
lower levels may be detected in certain tissues or cell types (e.g., developmental, 
proliferative cancerous and wounded tissues) or bodily fluids (e.g., lymph, serum, 
plasma, urine, amniotic fluid, synovial fluid or spinal fluid) taken from an individual 
having such a disorder, relative to the standard gene expression level, i.e., the 
expression level in healthy tissue from an individual not having the disorder. 

Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 
176 as residues: Tyr-30 to Ser-40. 

The tissue distribution in pharynx carcinoma and Hodgkin's lymphoma 
suggests that the protein product of this clone would be useful for diagnosis and 
treatment of immune and proliferative conditions. Moreover, expression within fetal 
tissue and other cellular sources marked by proliferating cells suggests this protein 
may play a role in'Qie regulation of cellular division, and may show utility in the 
diagnosis and treatment of cancer and other proliferative disorders. Similarly, 
developmental tissues rely on decisions involving cell differentiation and/or apoptosis 
in pattern formation. Dysregulation of apoptosis can result in inappropriate 
suppression of cell death, as occurs in the development of some cancers, or in failure 
to control the extent of cell death, as is believed to occur in acquired 
immunodeficiency and certain neurodegenerative disorders, such as spinal muscular 
atrophy (SMA). Therefore, the polynucleotides and polypeptides of the present 
invention are useful in treating, detecting, and/or preventing said disorders and 
conditions, in addition to other types of degenerative conditions. Thus this protein 
may modulate apoptosis or tissue differentiation and would be useful in the detection, 
treatment, and/or prevention of degenerative or proliferative conditions and diseases. 

Alternatively, the protein product of this clone is useful for the detection, 
treatment,-and/or preventionof neurode^ 

disorders, or inflammatory conditions which include, but are not limited to 
Alzheimer's Disease, Parkinson's Disease, Huntington's Disease, Tourette Syndrome, 
meningitis, encephalitis, demyelinating diseases, peripheral neuropathies, neoplasia, 
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trauma, congenital malformations, spinal cord injuries, ischemia and infarction, 
aneurysms, hemorrhages, schizophrenia, mania, dementia, paranoia, obsessive 
compulsive disorder, depression, panic disorder, learning disabilities, ALS, 
psychoses, autism, and altered behaviors, including disorders in feeding, sleep 
5 patterns, balance, and perception. In addition, elevated 

Expression of this gene product in regions of the brain suggests it plays a role 
in normal neural function. Potentially, this gene product is involved in synapse 
formation, neurotransmission, learning, cognition, homeostasis, or neuronal 
differentiation or survival. Protein, as well as, antibodies directed against the protein 
10 may show utility as a tumor marker and/or immunotherapy targets for the above listed 
tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO:79 and may have been publicly available prior to conception of 

15 the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence 
would be cumbersome. Accordingly, preferably excluded from the present invention 
are one or more polynucleotides comprising a nucleotide sequence described by the 
general formula of a-b, where a is any integer between 1 to 1495 of SEQ ID NO:79, b 

20 is an integer of 15 to 1509, where both a and b correspond to the positions of 

nucleotide residues shown in SEQ ID NO:79, and where b is greater than or equal to a 
+ 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 70 

25 The translation product of this gene shares sequence homology with insulin- 

like growth factor binding protein. Moreover, the protein has homology to the human 
Slit-1 protein (See Genbank Accession No. gnllPIDId 1036 170 (AB017167)), which is 
thought to-play"an~integral"role~in~neuraJ develops 

the slit gene has been shown to play a critical role in CNS midline formation. Each 
30 Slit gene encodes a putative secreted protein, which contains conserved protein- 
protein interaction domains including leucine-rich repeats (LRR) and epidermal 
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growth factor (EGF)-like motifs, like that of the Drosophila protein. The Slit genes 
form an evolutionary conserved group in vertebrates and invertebrates, and the 
mammalian Slit proteins may participate in the formation and maintenance of the 
nervous and endocrine systems by protein-protein interactions. 

In specific embodiments, polypeptides of the invention comprise the following 
amino acid sequence: the EGF-like domain: CCCRLGLSGPKC (SEQ ID NO:292); in 
addition to the following: RAFWGLGALQLLDLSANQLEAL (SEQ ID NO:293), 
HASGRRTGSADDGLQGRTGSGPPTAGAGGGGAAP (SEQ ID NO:294), 
VSAAAGARLAPRAPGAPAGCRPMRGCAARAAARKSLVPVLPAGWRSGPAA 
AARPGPRRLAHAPSAARSRAGPGAVARPLPRRHLAAAHGRGCGPAAARAGA 
GSGPGARRAARVPTAGRPPGTHVHTSGQSGAPRDPEGEALADTWAQTGQGD 
SSSNSSSSGRGRDQEGPRMGAAPPPPAPAVGGPLPVRPWSPSSAEPVLRPDAW 
(SEQ ID NO:295), 

TRPAAERAPRTTGSRDAQAAGLPPRVPGAGGLPPCGALPGR 
GLGRCCCCCCCeRLGLSGPKCRPGPRPRGPWAPRTAPRCARACREACQLSAL 
SLPAVPPGLSLRLRALLLDHNRVRALPPGAFAGAGALQRLDLRENGLHSVHV 
RAFWGLGALQLLDLSANQLEALAPGTFAPLRALRNLSLAGNRLARLEPAALG 
ALPLLRSLSLQDNELAALAPGLLGRLPALDALHLRGNPWGCGCALRPLCAWL 
RRHPLPASEAETVLCVWPGRLTLSPLTAFSDAAFSHCAQPLALRDLARGLHA 
RAGLLPRQPGFLPGAGLWAHRLPCAPPPPPHRRPPPAETVQTRTPIPTPTAVPR 
PRTRG APS A A AQA (SEQ ID NO:296), 

GCRPMRGCAARAAARKSLVPVLPAGWRSGP AAAARPGPRRLAHAPSA (SEQ 
ID NO: 297), PGAVARPLPRRHLAAAHGRGCG PAAARAGA (SEQ ID NO:298), 
S GQSG APRDPEGE AL ADT W AQTGQ (SEQ ID NO:299), 
PPAPAVGGPLPVRPWSPSSAEPV (SEQ ID NO:300), APRTTGSRD 
AQAAGLPPRVPGAGGLP (SEQ ID NO:301), GPRPRGPWAPRTAPRCARACRE 
(SEQ ID NO:302), AVPPGLSLRLRALLLDHNRVRALPPGAFAGA (SEQ ID 
- NO:303), LGAtQttDLSANQLEALAPGTFAP (SEQ-ID-NO:304)rPPGAFAGAG- 
ALQRLDLRENGLHSVHVRAFWGLGALQ (SEQ ID NO:305), RNLSLAGNRLA 
RLEP A ALG ALPLLRS LS (SEQ ID NO:306), LPALDALHLRGNPWGCGCALRP 
LCAW (SEQ ID NO:307), TVLCVWPGRLTLSPLTAFSDAAFSHCAQPLALRD 
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(SEQ ID NO:308), LHARAGLLPRQPGFLPGAGLWAHR (SEQ ID NO:309), 
and/or TVQTRTPIPTPTAVPRPRTRGAPS (SEQ ID NO:310). Polynucleotides 
encoding these polypeptides are also encompassed by the invention. 

It has been discovered that this gene is expressed primarily in a breast cancer 
5 cell line, MDA36. 

Therefore, nucleic acids of the invention are useful as reagents for differential 
identification of the tissue(s) or cell type(s) present in a biological sample and for 
diagnosis of the following diseases and conditions: neural, reproductive, and 
proliferative diseases and/or disorders, particularly breast cancer and degenerative 
10 conditions. Similarly, polypeptides and antibodies directed to those polypeptides are 
useful to provide immunological probes for differential identification of the tissue(s) 
or cell type(s). For a number of disorders of the above tissues or cells, particularly of 
the reproductive system, expression of this gene at significantly higher or lower levels 
may be detected in certain tissues or cell types (e.g., neural, reproductive, and 
15 proliferative cancerous and wounded tissues) or bodily fluids (e.g., lymph, serum, 
plasma, urine, synovial fluid or spinal fluid) taken from an individual having such a 
disorder, relative to the standard gene expression level, i.e., the expression level in 
healthy tissue from an individual not having the disorder. 

Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 
20 177 as residues: Met-1 to Arg-10, Arg-64 to Ala-71, Gly-124 to Gly-13 1, Pro-189 to 
Arg- 1 94, Val-223 to Gly-228. 

The tissue distribution in a breast cancer cells and tissues and homology to 
insulin-like -growth factor binding protien suggests that the protein product of this 
clone would be useful for diagnosis and treatment of breast cancer, and other forms of 
25 cancer. Moreover, the homology to the conserved human slit-1 protein suggests that 
the protein is useful in the treatment, diagnosis, and/or prevention of neural disorders, 
particularly developmental and degenerative conditions. Similarly, the protein is 

useful for the treatment-and/or diagnosis of neurodegenerative disease states; ; ~ 

behavioral disorders, or inflammatory conditions which include, but are not limited to 
30 Alzheimer's Disease, Parkinson's Disease, Huntington's Disease, Tourette Syndrome, 
meningitis, encephalitis, demyelinating diseases, peripheral neuropathies, neoplasia, 



WO 99/47540 



PCTAJS99/05804 



128 

trauma, congenital malformations, spinal cord injuries, ischemia and infarction, 
aneurysms, hemorrhages, schizophrenia, mania, dementia, paranoia, obsessive 
compulsive disorder, depression, panic disorder, learning disabilities, ALS, 
psychoses, autism, and altered behaviors, including disorders in feeding, sleep 
patterns, balance, and perception. In addition, elevated 

Expression of this gene product in regions of the brain suggests it plays a role 
in normal neural function. Potentially, this gene product is involved in synapse 
formation, neurotransmission, learning, cognition, homeostasis, or neuronal 
differentiation or survival. Protein, as well as, antibodies directed against the protein 
may show utility as a tumor marker and/or immunotherapy targets for the above listed 
tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO: 80 and may have been publicly available prior to conception of 
the present invention. Preferably, suchrelated polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence 
would be cumbersome. Accordingly, preferably excluded from the present invention 
are one or more polynucleotides comprising a nucleotide sequence described by the 
general formula of a-b, where a is any integer between 1 to 1095 of SEQ ID NO: 80, b 
is an integer of 15 to 1 109, where both a and b correspond to the positions of 
nucleotide residues shown in SEQ ID NO:80, and where b is greater than or equal to a 
+ 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 71 

In specific embodiments, polypeptides of the invention comprise the following 
amino acid sequence: HASGRPDRSSAPIGNSGLPCPDLEPLGGLQSKCRLCAPTE 
ARGL WS RS LCS DRCDTWRS (SEQ ID NO:31 1), and/or GLPCPDLEPLGGLQSK 

-CRLCAPTEARGLW-(SEQ-^ 

polypeptides are also encompassed by the invention. This gene also maps to 
chromosome 1 , and therefore can be used in linkage analysis as a marker for 
chromosome 1. 
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It has been discovered that this gene is expressed primarily in salivary gland 
and colon carcinoma. 

Therefore, nucleic acids of the invention are useful as reagents for differential 
identification of the tissue(s) or cell type(s) present in a biological sample and for 
5 diagnosis of the following diseases and conditions: colon carcinoma and other 

digestive system or gastrointestinal diseases and/or disorders. Similarly, polypeptides 
and antibodies directed to those polypeptides are useful to provide immunological 
probes for differential identification of the tissue(s) or cell type(s). For a number of 
disorders of the above tissues or cells, particularly of the digestive system, expression 
10 of this gene at significantly higher or lower levels may be detected in certain tissues 
or cell types (e.g., digestive system, gastrointestinal, metabolic, and cancerous and 
wounded tissues) or bodily fluids (e.g., lymph, serum, plasma, urine, chyme, bile, 
synovial fluid or spinal fluid) taken from an individual having such a disorder, 
relative to the standard gene expression level, i.e., the expression level in healthy 
15 tissue from an individual hot haying the disorder. 

Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 
178 as residues: Val-34 to Leu-39, Ser-64 to Cys-74, Ser-86 to Ser-95, Arg-128 to 
Ala-136. 

The tissue distribution in salivary gland and colon carcinoma suggests that the 
20 protein product of this clone would be useful for the treatment and diagnosis colon 
cancer and other digestive system diseases and/or disorders, such as ulcers, and other 
proliferative conditions. Protein, as well as, antibodies directed against the protein 
may show utility as a tumor marker and/or immunotherapy targets for the above listed 
tissues. 

25 Many polynucleotide sequences, such as EST sequences, are publicly 

available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO: 81 and may have been publicly available prior to conception of 

the-present invention. r Preferably^such related polynucleotides are specifically 

excluded from the scope of the present invention. To list every related sequence 

30 would be cumbersome. Accordingly, preferably excluded from the present invention 
are one or more polynucleotides comprising a nucleotide sequence described by the 
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general formula of a-b, where a is any integer between 1 to 793 of SEQ ID NO:81, b 
is an integer of 15 to 807, where both a and b correspond to the positions of 
nucleotide residues shown in SEQ ID NO:81, and where b is greater than or equal to a 
+ 14. 

5 

FEATURES OF PROTEIN ENCODED BY GENE NO: 72 

In specific embodiments, polypeptides of the invention comprise the following 
amino acid sequence: QEWESELGERRKPLQA (SEQ ID NO:313). Polynucleotides 
encoding these polypeptides are also encompassed by the invention. 
10 It has been discovered that this gene is expressed primarily in 6 week old 

human embryos. 

Therefore, nucleic acids of the invention are useful as reagents for differential 
identification of the tissue(s) or cell type(s) present in a biological sample and for 
diagnosis of the following diseases and conditions: embryological defects; aberrant 
15 development; aberrant cellular proliferation (e.g. cancers), and other developmentally 
related or proliferative diseases and/or disorders. Similarly, polypeptides and 
antibodies directed to those polypeptides are useful to provide immunological probes 
for differential identification of the tissue(s) or cell type(s). For a number of disorders 
of the above tissues or cells, particularly of the developing human embryo, expression 
20 of this gene at significantly higher or lower levels may be detected in certain tissues 
or cell types (e.g., developmental, and cancerous and wounded tissues) or bodily 
fluids (e.g., lymph, serum, plasma, amniotic fluid, urine, synovial fluid or spinal fluid) 

taken from an individual having-such-a disorder-,-relative-to the-standard-gene 

expression level, i.e., the expression level in healthy tissue from an individual not 
25 having the disorder. 

The tissue distribution in 6 week old human embryos suggests that the protein 
product of this clone would be useful for the diagnosis and/or treatment of defects in 

embryonic-development-Elevated expression of this gene product in early 6~week 

human embryos suggests that this gene product plays a critical role in normal human 
30 development. Alternatively, this gene product may be involved in the pattern of 
cellular proliferation that accompanies early embryogenesis. Thus, aberrant . 
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Expression of this gene product in tissues - particularly adult tissues - may 
correlate with patterns of abnormal cellular proliferation, such as found in various 
cancers. Moreover, this protein may play a role in the regulation of cellular division, 
and may show utility in the diagnosis and treatment of cancer and other proliferative 
5 disorders. Similarly, developmental tissues rely on decisions involving cell 

differentiation and/or apoptosis in pattern formation. Dysregulation of apoptosis can 
result in inappropriate suppression of cell death, as occurs in the development of some 
cancers, or in failure to control the extent of cell death, as is believed to occur in 
acquired immunodeficiency and certain neurodegenerative disorders, such as spinal 

10 muscular atrophy (SMA). Therefore, the polynucleotides and polypeptides of the 
present invention are useful in treating, detecting, and/or preventing said disorders 
and conditions, in addition to other types of degenerative conditions. Thus this protein 
may modulate apoptosis or tissue differentiation and would be useful in the detection, 
treatment, and/or prevention of degenerative or proliferative conditions and diseases. 

15 Protein, as well as; -antibodies directed against the protein may show utility as a tumor 
marker and/or immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO:82 and may have been publicly available prior to conception of 

20 the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence 
would be cumbersome. Accordingly, preferably excluded from the present invention 
are one or more polynucleotides comprising a nucleotide sequence described by the 
general formula of a-b, where a is any integer between 1 to 1029 of SEQ ID NO:82, b 

25 is an integer of 15 to 1043, where both a and b correspond to the positions of 

nucleotide residues shown in SEQ ID NO:82, and where b is greater than or equal to a 
+ 14. 



FEATURES OF PROTEIN ENCODED BY GENE NO: 73 

30 In specific embodiments, polypeptides of the invention comprise the following 

amino acid sequence: CQSSNLIFFQFVNILFNLMMDILVDFSITKMPINS1FSLYF 
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CYEII (SEQ ID NO:314). Polynucleotides encoding these polypeptides are also 
encompassed by the invention. 

It has been discovered that this gene is expressed primarily in 6 week old 
human embryo. 

5 Therefore, nucleic acids of the invention are useful as reagents for differential 

identification of the tissue(s) or cell type(s) present in a biological sample and for 
diagnosis of the following diseases and conditions: abnormal embryonic 
development; abnormal cellular proliferation; developmental defects, and other 
developmental^ related or proliferative diseases and/or conditions. Similarly, 
10 polypeptides and antibodies directed to those polypeptides are useful to provide 

immunological probes for differential identification of the tissue(s) or cell type(s). For 
a number of disorders of the above tissues or cells, particularly of the developing 
human embryo, expression of this gene at significantly higher or lower levels may be 
detected in certain tissues or cell types (e.g., developmental, and cancerous and 
1 5 wounded tissues) or bodily fluids <e.g:, lymph, serum, plasma, amniotic fluid, urine, 
synovial fluid or spinal fluid) taken from an individual having such a disorder, 
relative to the standard gene expression level, i.e., the expression level in healthy 
tissue from an individual not having the disorder. 

The tissue distribution in 6 week old human embryo suggests that the protein 
20 product of this clone would be useful for the diagnosis and treatment of disorders of 
human embryonic development. Expression of this clone in developing embryos 
suggests that it plays a critical role in early human development. Alternatively, it may 

be. involved in key cellular-proliferation events that occur during embryogenesis. -— 

Therefore misexpression of this gene in adult tissues may lead to abnormal patterns of 
25 cellular proliferation and cancer. Moreover, expression within embryonic tissue and 
other cellular sources marked by proliferating cells suggests this protein may play a 
role in the regulation of cellular division, and may show utility in the diagnosis and 

treatment of-caneer-and other-pro^ — 

rely on decisions involving cell differentiation and/or apoptosis in pattern formation. 
30 Dysregulation of apoptosis can result in inappropriate suppression of cell death, as 
occurs in the development of some cancers, or in failure to control the extent of cell 
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death, as is believed to occur in acquired immunodeficiency and certain 
neurodegenerative disorders, such as spinal muscular atrophy (SMA). Therefore, the 
polynucleotides and polypeptides of the present invention are useful in treating, 
detecting, and/or preventing said disorders and conditions, in addition to other types 
5 of degenerative conditions. Thus this protein may modulate apoptosis or tissue 

differentiation and would be useful in the detection, treatment, and/or prevention of 
degenerative or proliferative conditions and diseases. Protein, as -well as, antibodies 
directed against the protein may show utility as a tumor marker and/or 
immunotherapy targets for the above listed tissues. 

10 Many polynucleotide sequences, such as EST sequences, are publicly 

available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO:83 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence 

1 5 would be cumbersome. Accordingly, preferably excluded from the present invention 
are one or more polynucleotides comprising a nucleotide sequence described by the 
general formula of a-b, where a is any integer between 1 to 1 159 of SEQ ID NO: 83, b 
is an integer of 15 to 1 173, where both a and b correspond to the positions of 
nucleotide residues shown in SEQ ID NO:83, and where b is greater than or equal to a 

20 + 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 74 

In specific embodiments, polypeptides of the invention comprise the following 

amino acid sequence: GP VWLFCFLTLCRKPS QLFS QENS CMD V AG G VTTCLPP 
25 WFSRGAPAQMSQWPPSSDHGAVRAGRDSRVGPVQPSHLTCEGGKEEREKNK 

KAEVNPPTGMGLANRIPRDDITLKLRNQGKLRTKENRTQSAKRHP (SEQ ID 

NO:315), VACKPENRTKTHFASSPACDGHALGGQVGFAICFLSCLFPPM (SEQ 
ID-NO:3 1 6),.and/or SHPMPNTPQKQLLFSEDNELLVSLRTGRKPTLQAALRVTG - 

(SEQ ID NO:317). Polynucleotides encoding these polypeptides are also 
30 encompassed by the invention. 
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It has been discovered that this gene is expressed primarily in pleural cancer 
and endometrial tumors, and, to a lesser extent, in bone marrow & apoptotic T cells. 

Therefore, nucleic acids of the invention are useful as reagents for differential 
identification of the tissue(s) or cell type(s) present in a biological sample and for 
diagnosis of the following diseases and conditions: pleural cancer; endometrial 
tumors; hematopoietic disorders; immune dysfunction. Similarly, polypeptides and 
antibodies directed to those polypeptides are useful to provide immunological probes 
for differential identification of the tissue(s) or cell type(s). For a number of disorders 
of the above tissues or cells, particularly of the lungs and immune system, expression 
of this gene at significantly higher or lower levels may be detected in certain tissues 
or cell types (e.g., immune, hematopoietic, reproductive, and cancerous and wounded 
tissues) or bodily fluids (e.g., lymph, serum, plasma, urine, synovial fluid or spinal 
fluid) taken from an individual having such a disorder, relative to the standard gene 
expression level, i.e., the expression level in healthy tissue from an individual not 
having the disorder; 

The tissue distribution in pleural cancer and endometrial tumors indicates that 
the protein products of this clone are useful for the diagnosis and treatment of various 
reproductive cancers, including pleural cancer and endometrial tumors. In addition, 

Expression of this gene product within T cells & bone marrow suggests that it 
may play a role in normal hematopoiesis. Therefore, this gene product may also be 
useful in the diagnosis and/or treatment of a variety of hematopoietic disorders, 
including defects in immune surveillance, inflammation, impaired immune function, 

— and-T-cell lymphomasT-Use ofthis gene product may be appropriate in situations 

designed to affect the proliferation, survival, and/or differentiation of various 
hematopoietic cell lineages, including blood stem cells. 

Moreover, this protein may play a role in the regulation of cellular division, 
and may show utility in the diagnosis and treatment of cancer and other proliferative 
disordersrSimilarlyrde 
differentiation and/or apoptosis in pattern formation. Dysregulation of apoptosis can 
result in inappropriate suppression of cell death, as occurs in the development of some 
cancers, or in failure to control the extent of cell death, as is believed to occur in 
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acquired immunodeficiency and certain neurodegenerative disorders, such as spinal 
muscular atrophy (SMA). Therefore, the polynucleotides and polypeptides of the 
present invention are useful in treating, detecting, and/or preventing said disorders 
and conditions, in addition to other types of degenerative conditions. Thus this protein 
may modulate apoptosis or tissue differentiation and would be useful in the detection, 
treatment, and/or prevention of degenerative or proliferative conditions and diseases. 
Protein, as well as, antibodies directed against the protein may show utility as a tumor 
marker and/or immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO: 84 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence 
would be cumbersome. Accordingly, preferably excluded from the present invention 
are one or more polynucleotides comprising a nucleotide- sequence described by the 
general formula of a-b, where a is any integer between 1 to 1547 of SEQ ID NO:84, b 
is an integer of 15 to 1561, where both a and b correspond to the positions of 
nucleotide residues shown in SEQ ID NO:84, and where b is greater than or equal to a 
+ 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 75 

The translation product of this gene shares low sequence homology with dreg- 
2, a gene product originally identified in Drosophila that shows an oscillating pattern 
of expression tied into a circadian clock rhythm. 

In specific embodiments, polypeptides of the invention comprise the following 
amino acid sequence: 

AHRLQIRIXTWDVKDTLLRLRHPLGEAYATKARAHGLEV 
EPSALEQGFRQAYRAQSHSFPNYGLSH * 
QAVAPIAEQLYKDFSHPCTWQVLDGAEDTLRECRTRGLRLAVISNFDRRLEGI 
LXGLGLREHFDFVLTSEAAGWPKPDPRIFQEALRLAHMEPVVAAHVGDNYL 
CDYQGPRAVGMHSFLVVGPQALDPVVRDSVPKEHILPSLAHLLPALDCLEGS 
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TPGL (SEQ ID N 0:319), 

EGDPRGRPRPRPLGPPPQLTLPTALXDILRQVRAPGLRLSRA 
LEVGRKGSPIFKIQIYL (SEQ ID NO:318), IRLLTWDVKDTLLRLRHPLGEAYA 
TKA (SEQ ID NO:320), LEQGFRQAYRAQSHSFPNYGLSHG (SEQ ID NO:321), 
HLAGVQDAQAVAPIAEQLYKDFSHPC (SEQ ID NO:322), VLDGAEDTLRECR 
TRGLRLAVIS (SEQ ID NO:323), REHFDFVLTSEAAGWPKPDPRIFQEA (SEQ 
ID NO:324), EPVVAAHVGDNYLCDYQGPRAVGMHSFL (SEQ ID NO:325), 
and/or VVRDSVPKEHILPSLAHLLPALD (SEQ ID NO:326). Polynucleotides 
encoding these polypeptides are also encompassed by the invention. 

It has been discovered that this gene is expressed primarily in tumors of the 
pancreas & thymus and to a lesser extent in a variety of fetal tissues, including fetal 
brain, liver, spleen, and kidney. 

Therefore, nucleic acids of the invention are useful as reagents for differential 
identification of the tissue(s) or cell type(s) present in a biological sample and for 
diagnosis of the following diseases and conditions: pancreatic cancer; thymic cancer; 
disorders of fetal development; abnormal cellular proliferation; hematopoietic 
disorders. Similarly, polypeptides and antibodies directed to those polypeptides are 
useful to provide immunological probes for differential identification of the tissue(s) 
or cell type(s). For a number of disorders of the above tissues or cells, particularly of 
the pancreas and immune system, expression of this gene at significantly higher or 
lower levels may be detected in certain tissues or cell types (e.g., developmental, 
metabolic, immune, hematopoietic, and cancerous and wounded tissues) or bodily 
fluids (e.g., lymph, serum, plasma, amniotic fluid, urine, synovial fluid or spinal fluid) - 
taken from an individual having such a disorder, relative to the standard gene 
expression level, i.e., the expression level in healthy tissue from an individual not 
having the disorder. 

The tissue distribution in proliferative and developmental cells and tissues 

- indicates that the-protein products of this clone are useful for the diagnosis and 

treatment of cancers, particularly pancreatic and thymic cancer. Expression of this 
gene product within various fetal tissues also indicates that it is useful in the diagnosis 
and/or treatment of human developmental disorders. Taken together, the observation 
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that this gene product is expressed in cancers and in fetal tissues indicates that it plays 
a role in proliferation and/or differentiation events that are associated with early 
development. Misexpression of this gene product in adult tissues, therefore, may 
directly contribute to abnormal cellular, proliferation and/or dedifferentiation that 
accompanies cancer. Finally, 

Moreover, the expression of this gene product in fetal liver/spleen also 
suggests that it plays a role in hematopoiesis, and is useful in the diagnosis and/or 
treatment of a variety of disorders of the immune system. Protein, as well as, 
antibodies directed against the protein may show utility as a tumor marker and/or 
immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO:85 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence 
would be cumbersome. Accordingly, preferably excluded from the present invention 
are one or more polynucleotides comprising a nucleotide sequence described by the 
general formula of a-b, where a is any integer between 1 to 1419 of SEQ ID NO:85, b 
is an integer of 15 to 1433, where both a and b correspond to the positions of 
nucleotide residues shown in SEQ ID NO: 85, and where b is greater than or equal to a 
+ 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 76 

In specific embodiments, polypeptides of the invention comprise the following 
amino acid sequence: IRKLGPGLAPCSCRSGQVFPRV (SEQ ID NO:327). 
Polynucleotides encoding these polypeptides are also encompassed by the invention. 

It has been discovered that this gene is expressed primarily in frontal cortex, 
^particularly derived from epileptic patients. 

Therefore, nucleic acids of the invention are useful as reagents for differential 
identification of the tissue(s) or cell type(s) present in a biological sample and for 
diagnosis of the following diseases and conditions: epilepsy; neurodegenerative 
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diseases and disorders, particularly learning disorders. Similarly, polypeptides and 
antibodies directed to those polypeptides are useful to provide immunological probes 
for differential identification of the tissue(s) or cell type(s). For a number of disorders 
of the above tissues or cells, particularly of the brain, CNS, and/or PNS, expression of 
5 this gene at significantly higher or lower levels may be detected in certain tissues or 
cell types (e.g., neural, and cancerous and wounded tissues) or bodily fluids (e.g., 
lymph, serum, plasma, urine, synovial fluid or spinal fluid) taken from an individual 
having such a disorder, relative to the standard gene expression level, i.e., the 
expression level in healthy tissue from an individual not having the disorder. 

10 The tissue distribution in frontal cortex tissue suggests that the protein product 

of this clone would be useful for the diagnosis and/or treatment of disorders of the 
brain and nervous system, particularly epilepsy. Moreover, the expression of this gene 
product suggests that it may play a role in various critical processes of the nervous 
system, including nerve survival, pathfinding, signal conductance, and/or synapse 

15 formation. It may have effects on various processes including homeostasis, learning, 
motor function, language, etc. Potentially, this gene product is involved in synapse 
formation, neurotransmission, learning, cognition, homeostasis, or neuronal 
differentiation or survival. Protein, as well as, antibodies directed against the protein 
may show utility as a tumor marker and/or immunotherapy targets for the above listed 

20 tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO: 86 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
25 excluded from the scope of the present invention. To list every related sequence 

would be cumbersome. Accordingly, preferably excluded from the present invention 
are one or more polynucleotides comprising a nucleotide sequence described by the 

- — general formula.of a-b,-where a-is any integer-between- Ho 1-363 of SEQ ID N0:86rb 

is an integer of 15 to 1377, where both a and b correspond to the positions of 
30 nucleotide residues shown in SEQ ID NO:86, and where b is greater than or equal to a 
+ 14. 
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FEATURES OF PROTEIN ENCODED BY GENE NO: 77 

In specific embodiments, polypeptides of the invention comprise the following 
amino acid sequence: 

KPLRMARPGGPEHNEYALVSAWHSSGSYLDSEGLRHQDD 
FDVSLLVCHCAAPFEEQGEAERHVLRLQFFVVLTSQRELFPRLTADMRRFRK 
PPRLPPEPEAPGSSAGSPGEASGLILAPGPAPLFPPLAAEVGMARARLAQLVRL 
AGGHCRRDTLWKRLFLLEPPGPDRLRLGGRLALAELEELLEAVHAKSIGDIDP 
QLDCFLSMTVSWYQSLIKVLLSRFPRAVAISKAQTWELSTWLR (SEQ ID 
NO:328), ARGTLELPTPLIAAHQLYNYVADHASSYHM (SEQ ID NO:329), 
SHCEWPGQG AQNTTSMPWCRHGTVLAPTWTLRDFDTR (SEQ ID NO:330), 
PLTTVSHLCPL 

SLRVFTSHLDITAGHSHRDDTWVPIPALPLKHLRPPSSPFALGPWVSHPLMRW 

VQKLSHLHSNPGTGFSMGGKSAEKLKC (SEQ ID NO:331), STAARGAPGPGR 

AGGTPRSSPCQIHWGHRPPAGLLPIHDGLLVPEPDQSSPKPLPQSCRHFQSPDL 

GTQYLVALNQKFTDCSALVFWTPLRKDVSEVVFREALPVQPQDTRSPPAQLV 

STYHHLESVINTACFTLLDPPPLKGVDWTTECHCSLNHGPTRLPARGRTDQPF 

WAPGQARH (SEQ ID N O : 3 3 2 ) , 

HQRLCNYVLRVCCPSLAAGTALPKHPQPLTHPGL 

QRVRSTPRTPWALLGYSFRPPW (SEQ ID NO:333), 
PGGPEHNEYALVSAWHSS GSYLDSEGLR (SEQ ID NO:334), 
D VSLL VCHC A APFEEQGE AERH VLR (SEQ ID NO:335), 
RLTADMRRFRKPPRLPPEPEAPGSSAGS (SEQ ID NO:336), GEASGLI 
LAPGPAPLFPPLA AE VGM (SEQ ID NO:337), 

TLWKRLFLLEPPGPDRLRLGGRL (SEQ ID NO:338), and/or 
LAELEELLEAVHAKSIGDIDPQLDCFLS (SEQ ID NO:339). Polynucleotides 
encoding these polypeptides are also encompassed by the invention. 

It-has-been-discovered that-this-gene-is expressed primafily in fetal liver/spleen " 

and leukocytes, and to a lesser extent in a colon adenocarcinoma cell line. 

Therefore, nucleic acids of the invention are useful as reagents for differential 
identification of the tissue(s) or cell type(s) present in a biological sample and for 
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diagnosis of the following diseases and conditions: hematopoietic disorders; immune 
dysfunction; colon cancer; colorectal adenocarcinoma. Similarly, polypeptides and 
antibodies directed to those polypeptides are useful to provide immunological probes 
for differential identification of the tissue(s) or cell type(s). For a number of disorders 
of the above tissues or cells, particularly of the immune system and colon, expression 
of this gene at significantly higher or lower levels may be detected in certain tissues 
or cell types (e.g., hematopoietic, immune, gastrointestinal, and cancerous and 
wounded tissues) or bodily fluids (e.g., lymph, serum, plasma, urine, synovial fluid or 
spinal fluid) taken from an individual having such a disorder, relative to the standard 
gene expression level, i.e., the expression level in healthy tissue from an individual 
not having the disorder. 

Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 
184 as residues: Leu- 16 to Ser-23, Ser-38 to Pro-43, Gly-53 to Leu-60. 

The tissue distribution in colon adenocarcinoma suggests that the protein 
product of this clone would be useful for the diagnosis and/or treatment of 
gastrointestinal diseases and/or disorders, particularly proliferative conditions. 
Expression of this gene product in fetal and proliferative cells and tissues suggests 
that it may be a marker cancers, and that it's misregulated expression may in fact 
contribute to the development or progression of the types of cancers dictated by its 
expression. 

Similarly, the expression of this gene product in fetal liver/spleen - a primary 
site of early hematopoiesis - taken together with its expression in peripheral blood 
leukocytes suggests that this gene product may play a role in a variety of 
hematopoietic processes, including the survival, proliferation, activation, and/or 
differentiation of all blood cell lineages, including the totipotent hematopoietic stem 
cell. Such a gene product may therefore play a role in a variety of hematopoietic 
disorders including inflammation; immune dysfunction; defects in immune 
- -surveillance; and-hematopoiet^ 

tissues rely on decisions involving cell differentiation and/or apoptosis in pattern 
formation. Dysregulation of apoptosis can result in inappropriate suppression of cell 
death, as occurs in the development of some cancers, or in failure to control the extent 
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of cell death, as is believed to occur in acquired immunodeficiency and certain 
neurodegenerative disorders, such as spinal muscular atrophy (SMA). 

Therefore, the polynucleotides and polypeptides of the present invention are 
useful in treating, detecting, and/or preventing said disorders and conditions, in 
5 addition to other types of degenerative conditions. Thus this protein may modulate 
apoptosis or tissue differentiation and would be useful in the detection, treatment, 
and/or prevention of degenerative or proliferative conditions and diseases. Protein, as 
well as, antibodies directed against the protein may show utility as a tumor marker 
and/or immunotherapy targets for the above listed tissues. 

10 Many polynucleotide sequences, such as EST sequences, are publicly 

available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO: 87 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence 

15 would be cumbersome. Accordingly, preferably excluded from the present invention 
are one or more polynucleotides comprising a nucleotide sequence described by the 
general formula of a-b, where a is any integer between 1 to 1701 of SEQ ID NO:87, b 
is an integer of 15 to 1715, where both a and b correspond to the positions of 
nucleotide residues shown in SEQ ID NO:87, and where b is greater than or equal to a 

20 + 14. 



FEATURES OF PROTEIN ENCODED BY GENE NO: 78 

The gene encoding the disclosed cDNA is believed to reside on chromosome 
20. Accordingly, polynucleotides related to this invention are useful as a marker in 
25 linkage analysis for chromosome 20. 

It has been discovered that this gene is expressed primarily in brain. 
Therefore, nucleic acids of the invention are useful as reagents for differential 

~ ~ identification"of the tissue(s) or cell type(s)~pres<^ 

diagnosis of the following diseases and conditions: neurodegenerative diseases and/or 
30 disorders. Similarly, polypeptides and antibodies directed to those polypeptides are 
useful to provide immunological probes for differential identification of the tissue(s) 
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or cell type(s). For a number of disorders of the above tissues or cells, particularly of 
the central nervous system, expression of this gene at significantly higher or lower 
levels may be detected in certain tissues or cell types (e.g., neural, and cancerous and 
wounded tissues) or bodily fluids (e.g., lymph, serum, plasma, urine, synovial fluid or 
spinal fluid) taken from an individual having such a disorder, relative to the standard 
gene expression level, i.e., the expression level in healthy tissue from an individual 
not having the disorder. This gene is believed to reside on chromosome 20, D20S1 1 1- 
D20S195. Polynucleotides corresponding to this gene are useful, therefore, as 
chromosome markers. 

Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 
185 as residues: Met-1 to Tyr-6, Thr-38 to Ala-44. 

The tissue distribution in brain tissue indicates that the protein products of this 
clone are useful for diagnosis and treatment of disorders of the central nervous 
system. Moreover, the protein product of this clone is useful for the detection, 
treatment, and/or prevention of neurodegenerative disease states, behavioral 
disorders, or inflammatory conditions which include, but are not limited to 
Alzheimer's Disease, Parkinson's Disease, Huntington's Disease, Tourette Syndrome, 
meningitis, encephalitis, demyelinating diseases, peripheral neuropathies, neoplasia, 
trauma, congenital malformations, spinal cord injuries, ischemia and infarction, 
aneurysms, hemorrhages, schizophrenia, mania, dementia, paranoia, obsessive 
compulsive disorder, depression, panic disorder, learning disabilities, ALS, 
psychoses, autism, and altered behaviors, including disorders in feeding, sleep 
patterns, balance, and perception. 

In addition, elevated expression of this gene product in regions of the brain 
suggests it plays a role in normal neural function. Potentially, this gene product is 
involved in synapse formation, neurotransmission, learning, cognition, homeostasis, 
or neuronal differentiation or survival. Protein, as well as, antibodies directed against 
the protein may~show utility as~a~tumor marker and^ortmmunotherapy targets for the" 
above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
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related to SEQ ID NO:88 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence 
would be cumbersome. Accordingly, preferably excluded from the present invention 
5 are one or more polynucleotides comprising a nucleotide sequence described by the 
general formula of a-b, where a is any integer between 1 to 403 of SEQ ID NO:88, b 
is an integer of 15 to 417, where both a and b correspond to the positions of 
nucleotide residues shown in SEQ ID NO:88, and where b is greater than or equal to a 
+ 14. 

10 

FEATURES OF PROTEIN ENCODED BY GENE NO: 79 

When tested against U937 cell lines, supernatants removed from cells 
containing this gene activated the GAS (gamma activating sequence) promoter 
element. Thus, it is likely that this gene activates myeloid cells, and to a lesser extent, 

15 other immune and hematopoietic, cells or cell types, through the JAK-STAT signal 
transduction pathway. GAS is a promoter element found upstream of many genes 
which are involved in the Jak-STAT pathway. The Jak-STAT pathway is a large, 
signal transduction pathway involved in the differentiation and proliferation of cells. 
Therefore, activation of the Jak-STAT pathway, reflected by the binding of the GAS 

20 element, can be used to indicate proteins involved in the proliferation and 
differentiation of cells. 

In specific embodiments, polypeptides of the invention comprise the following 
amino acid sequence: FQLYFNPELIFKHFQIWRLITNFLFFGPVGFNFLFNMIFLY 
RYCRMLEEGSFRGRTADFVFMFLFGGFLMTLFGLFVSLVFLGQAFTIMLVYV 



25 WSRXNPYVRMNFFGLLNFQAPFLPWVLMGFSLLLGNSIIVDLLGIAVGHIYFF 
LEDVFPNQPGGIRILKTPSILKAIFDTPDEDPNYNPLPEERPGGFAWGEGQ SEQ 
ID N O : 3 4 0 ) , 




QLYFNPELIFKHFQIWRLITNFLFFGPVGFNFLFNMIFLYRYCRMLEEGSFRGR 
30 TADFVF (SEQ ID NO:341), LIFKHFQIWRLITNFLFFGPVGF (SEQ ID NO:342), 
FLYRYCRMLEEGSFRGRTADFVFMF (SEQ ID NO:343), LVFLGQAFTIMLVYV 
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WSRXNPYV (SEQ ID NO:344), VLMGFSLLLGNSIIVDLLGIA (SEQ ID NO:345), 
NQPGGIRILKTPSILKAIFDTPDED (SEQ ID NO:346), RLEYLQIPPVSRAYTTAC 
VLTTAAVQLE (SEQ ID NO:347), and/or RLITNFLFTGPVGFNFLFNMIFLYRYC 
RMLE (SEQ ID NO:348). Polynucleotides encoding these polypeptides are also 
5 encompassed by the invention. The gene encoding the disclosed cDNA is believed to 
reside on chromosome 17. Accordingly, polynucleotides related to this invention are 
useful as a marker in linkage analysis for chromosome 17. 

It has been discovered that this gene is expressed primarily in smooth muscle, 
fetal brain, fetal liver and to a lesser extent in activated macrophage, colon cancer. 

10 Therefore, nucleic acids of the invention are useful as reagents for differential 

identification of the tissue(s) or cell type(s) present in a biological sample and for 
diagnosis of the following diseases and conditions: developmental diseases, immune- 
related diseases, neural disorders, and vascular diseases and conditions. Similarly, 
polypeptides and antibodies directed to those polypeptides are useful to provide 

15 immunological probes for differential identification of the tissue(s) or cell type(s). For 
a number of disorders of the above tissues or cells, particularly of the immune system 
and central nervous system, expression of this gene at significantly higher or lower 
levels may be detected in certain tissues or cell types (e.g., developmental, vascular, 
immune, and cancerous and wounded tissues) or bodily fluids (e.g., lymph, serum, 

20 plasma, amniotic fluid, urine, synovial fluid or spinal fluid) taken from an individual 
having such a disorder, relative to the standard gene expression level, i.e., the 
expression level in healthy tissue from an individual not having the disorder. 

The tissue distribution in fetal liver, macrophage, and fetal brain indicates that 
the protein products of this clone are useful for treating and diagosis of immune 

25 system-related diseases and CNS diseases. Moreover, the protein product of this clone 
is useful for the treatment and diagnosis of hematopoietic related disorders such as 
anemia, pancytopenia, leukopenia, thrombocytopenia or leukemia since stromal cells 
are iri^iTaiinrnhe ^ uses include 

bone marrow cell ex- vivo culture, bone marrow transplantation, bone marrow 

30 reconstitution, radiotherapy or chemotherapy of neoplasia. The gene product may also 
be involved in lymphopoiesis, therefore, it can be used in immune disorders such as 



WO 99/47540 



PCT/US99/05804 



infection, inflammation, allergy, immunodeficiency etc. In addition, this gene product 
may have commercial utility in the expansion of stem cells and committed 
progenitors of various blood lineages, and in the differentiation and/or proliferation of 
various cell types. Alternatively, the protein is useful in the detection, treatment, 
5 and/or prevention of vascular conditions, which include, but are not limited to, 
microvascular disease, vascular leak syndrome, aneurysm, stroke, atherosclerosis, 
arteriosclerosis, or embolism. 

Moreover, the expression within fetal tissue and other cellular sources marked 
by proliferating cells, combined with the GAS biological activity, suggests this 

10 protein may play a role in the regulation of cellular division, and may show utility in 
the diagnosis and treatment of cancer and other proliferative disorders. Similarly, 
developmental tissues rely on decisions involving cell differentiation and/or apoptosis 
in pattern formation. Dysregulation of apoptosis can result in inappropriate 
suppression of cell death, as occurs in the development of some cancers, or in failure 

15 to control the extent of cell death, -jas is believed to occur in acquired 

immunodeficiency and certain neurodegenerative disorders, such as spinal muscular 
atrophy (SMA). Therefore, the polynucleotides and polypeptides of the present 
invention are useful in treating, detecting, and/or preventing said disorders and 
conditions, in addition to other types of degenerative conditions. Thus this protein 

20 may modulate apoptosis or tissue differentiation and would be useful in the detection, 
treatment, and/or prevention of degenerative or proliferative conditions and diseases. 
Protein, as well as, antibodies directed against the protein may show utility as a tumor 
marker and/or immunotherapy targets for the above listed tissues. Protein, as well as, 
antibodies directed against the protein may show utility as a tumor marker and/or 

25 immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 

related to SEQ ID NO:89 and may have been publicly-available-prior to conception of 

the present invention. Preferably, such related polynucleotides are specifically 

30 excluded from the scope of the present invention. To list every related sequence 

would be cumbersome. Accordingly, preferably excluded from the present invention 
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are one or more polynucleotides comprising a nucleotide sequence described by the 
general formula of a-b, where a is any integer between 1 to 1 153 of SEQ ID NO:89, b 
is an integer of 15 to 1 167, where both a and b correspond to the positions of 
nucleotide residues shown in SEQ ID NO:89, and where b is greater than or equal to a 
5 + 14. 



FEATURES OF PROTEIN ENCODED BY GENE NO: 80 

The translation product of this gene shares sequence homology with 
proacrosin binding proteins (sp32) from non-human mammalian species. The binding 

10 of sp32 to proacrosin may be involved in packaging the acrosin zymogen into the 
acrosomal matrix. See, for example, J Biol Chem. 1994 Apr 1; 269(13): 10133- 
10140, incorporated herein by reference. Accordingly, the inventors have termed the 
translation product of this gene human sp32 or "h-sp32 M . Contact of cells with 
supernatant expressing the product of this gene has been shown to increase the 

15 permeability of the- plasma membrane* of PMN to calcium. Thus it is likely that the 

product of this gene is involved in a signal transduction pathway that is initiated when 
the product binds a receptor on the surface of the plasma membrane of both 
neutrophils, and to a lesser extent in other immune and hematopoietic cells'. Thus, 
polynucleotides and polypeptides have uses which include, but are not limited to, 

20 activating 

In specific embodiments, polypeptides of the invention comprise the following 
amino acid sequence: HAS AGPDGSSPA (SEQ ID NO:349), 
ELLLEKPKPWQPPAAAPHRALLVLCYSIVENTCIITFTAKAWKYM 
KSVCDSLGRRHMSTCALCDFCSLKLEQCHSEASLQRQQCDTSHKTPFAAPCL 
25 PPRACPSATR (SEQ ID NO:350), 

LPGWGFPTKICDTDYIQYPNYCSFKSQQCLMR 

NRNRKVSRMRCLQNETYSALSPGKSEDVVLRWSQEFSTLTLGQFG (SEQ ID 
NO:351),- SRVLLPAEPPLRVPLLALPVSAPLPACVLVSAPACAPLLAPACAL 

ALAPGFPGTRRIVGALPRCC (SEQ ID NO:352), LLVLCYSIVENTCIITPTAK 
30 AWKYMEEEILGFGKS (SEQ ID NO:353), and/or LKLEQCHSEASLQRQQC 

DTSHKTPFA (SEQ ID NO:354). Polynucleotides encoding these polypeptides are 
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also encompassed by the invention. The gene encoding the disclosed cDNA is 
believed to reside on chromosome 12. Accordingly, polynucleotides related to this 
invention are useful as a marker in linkage analysis for chromosome 12. 

It has been discovered that this gene is expressed primarily in testis. 

Therefore, nucleic acids of the invention are useful as reagents for differential 
identification of the tissue(s) or cell type(s) present in a biological sample and for 
diagnosis of the following diseases and conditions: reproductive disorders. Similarly, 
polypeptides and antibodies directed to those polypeptides are useful to provide 
immunological probes for differential identification of the tissue(s) or cell type(s). For 
a number of disorders of the above tissues or cells, particularly of the reproductive 
diseases, expression of this gene at significantly higher or lower levels may be 
detected in certain tissues or cell types (e.g., reproductive, testis, prostate, epidiymus, 
and cancerous and wounded tissues) or bodily fluids (e.g., lymph, seminal fluid, 
serum, plasma, urine, synovial fluid or spinal fluid) taken from an individual having 
such a disorder, relative to the standard gene expression level, i.e., the expression 
level in healthy tissue from an individual not having the disorder. This gene is 
believed to map to chromosome 12 and is thought to be useful as a chromosome 
marker. 

Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 
187 as residues: Asp-27 to Ser-32, Pro-52 to Thr-58, Arg-63 to Asn-70, Gln-78 to 
Gly-83, Thr-107 to Asn-113, Thr-160 to Val-176, Ser-188 to Gly-241, Leu-248 to 
Pro-265, Tyr-302 to Gly-314. 

The tissue distribution in testis, combined with the specific homology to the 
sp32 protein indicates that the protein products of this clone are useful for the 
diagnosis, treating, and/or prevention of reproductive diseases and/or disorders. 
Moreover, polynucleotides and polypeptides corresponding to this gene are useful for 
the treatment and diagnosis of conditions concerning proper testicular function (e.g. 
_ endocrine Junction, sperm-maturation),as well-as cancer.^FhereforeHhis gene-product — 
is useful in the treatment of male infertility and/or impotence. This gene product is 
also useful in assays designed to identify binding agents, as such agents (antagonists) 
are useful as male contraceptive agents. 
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Similarly, the protein is believed to be useful in the treatment and/or diagnosis 
of testicular cancer. The testes are also a site of active gene expression of transcripts 
that may be expressed, particularly at low levels, in other tissues of the body. 
Therefore, this gene product may be expressed in other specific tissues or organs 
5 where it may play related functional roles in other processes, such as hematopoiesis, 
inflammation, bone formation, and kidney function, to name a few possible target 
indications. The protein is useful in application and utility as a contraceptive, either 
directly or indirectly. Based upon the detected calcium flux activity, the protein may 
also be useful as an effect treatment for infertility (i.e. for inhibiting autoimmune 

10 disorders). Protein, as well as, antibodies directed against the protein may show utility 
as a tumor marker and/or immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO:90 and may have been publicly available prior to conception of 

15 the present invention. Preferably, "such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence 
would be cumbersome. Accordingly, preferably excluded from the present invention 
are one or more polynucleotides comprising a nucleotide sequence described by the 
general formula of a-b, where a is any integer between 1 to 1878 of SEQ ID NO:90, b 

20 is an integer of 15 to 1892, where both a and b correspond to the positions of 

nucleotide residues shown in SEQ ID NO:90, and where b is greater than or equal to a 
+ 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 81 

25 The translation product of this contig has consistent sequence homology with 

a number of previously described viral tat proteins (see, for example, Stevens, et al., J. 

Virol. 64:3716-3725 (1990), which is hereby incorporated by reference, herein). 
— In-speeifie-embodiments^polypeptides of-the-invention-comprise-the following 

amino acid sequence: QVSGLILSLSCGMDGLALDGSPSPSPXTEKAGRCISQTSL 
30 (SEQ ID NO:355), QVSGLILSLSCGMDGLALDGSPSPSPXTEKAGRCISQTSLP 

GKWEV (SEQ ID NO:356), RASKTVPRMPPNWPAKMPCLCHIRTVEHLGTIS 
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SGAPGRPTGQQAARTYHICWIHPGQKIDSLPPSSQHPRSQQLAPGTWPSTSTT 
KPAEETLGSSASLPISQARKSEKCTFQPSPWXVRGKESHQVPAHPSHRTETES 
D HSPVRKPPSRGTRTGDFTVGDWSEAWLLELALL (SEQ ID NO:357), RMPPN 
WPAKMPCLCHIRTVEHLG (SEQ ID NO:358), GRPTGQQAARTYHICWIHPG 

5 QKIDS (SEQ ID NO:359), WPSTSTTKPAEETLGSSASLPISQA (SEQ ID NO:360), 
KSEKCTFQPSPWXVRGKESHQVP (SEQ ID NO:361), and/or KPPSRGTRTGDF 
TVGDWSEAWLLE (SEQ ID NO:362). Polynucleotides encoding these polypeptides 
are also encompassed by the invention. 

It has been discovered that this gene is expressed almost exclusively in 

10 neutrophils. 

Therefore, nucleic acids of the invention are useful as reagents for differential 
identification of the tissue(s) or cell type(s) present in a biological sample and for 
diagnosis of immune disorders. Similarly, polypeptides and antibodies directed to 
- those polypeptides are useful to provide immunological probes for differential 

15 identification of the tissue(s) or cell type(s). For a number of disorders of the immune 
system, expression of this gene at significantly higher or lower levels may be detected 
in certain tissues or cell types (e.g., immune, hematopoietic, and cancerous and 
wounded tissues) or bodily fluids (e.g., lymph, serum, plasma, urine, synovial fluid or 
spinal fluid) taken from an individual having such a disorder, relative to the standard 

20 gene expression level, i.e., the expression level in healthy tissue from an individual 

not having the disorder. In addition, molecules of the present invention can be used to 
regulate transcription and translation of genes in cells of the immune system, as well 
as in other cell types. Such transcriptional and translation regulation is useful for 
diagnosing and treating a number of disorders in which an alterred state of 

25 transcription and translation may be a factor in the disorder. Such disorders include 
many viral infections, particularly of immune cells, including HIV-1, HIV-2, human 
T-cell lymphotropic virus (HTLV)-I, and HTLV-II, as well as other DN A and RNA 

vifuses-such as herpes L simplex"vira 

(CMV), Epstein-Barr virus (EBV), herpes samirii, adenoviruses, rhinovinises, 

30 influenza viruses, reoviruses, and the like. In addition, the ability to use molecules of 
the present invention to molecularly regulate the processes of transcription and 
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translation is useful in the diagnosis and treatment of many types of cancers, 
particularly those of the immune system, including ovarian cancer, breast cancer, 
colon cancer, cardiac tumors, pancreatic cancer, melanoma, retinoblastoma, 
glioblastoma, lung cancer, intestinal cancer, testicular cancer, stomach cancer, 
5 neuroblastoma, myxoma, myoma, lymphoma, endothelioma, osteoblastoma, 
osteoclastoma, osteosarcoma, chondrosarcoma, adenoma, and the like. 

Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 
188 as residues: Gln-2 to Trp-12, Ala-30 to Glu-35, Gln-42 to Ser-51. 

The tissue distribution in neutrophils, combined with the homology to viral tat 

10 proteins suggests that the protein product of this clone is useful for the diagnosis and 
treatment of immune disorders, particularly viral infections and proliferative 
disorders. Further, since this clone has a high degree of sequence relatedness to 
factors which are involved in the regulation of transcription and translation, this clone 
is useful as a regulator of such processes. Protein, as well as, antibodies directed 

15 against the protein may show utility as a tumor marker and/or immunotherapy targets 
for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO:91 and may have been publicly available prior to conception of 

20 the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence 
would be cumbersome. Accordingly, preferably excluded from the present invention 
afeorT^of more poly nucledtidescomprisinga nucleotide sequencedescribedby the 
general formula of a-b, where a is any integer between 1 to 509 of SEQ ID NO:91, b 

25 is an integer of 15 to 523, where both a and b correspond to the positions of 

nucleotide residues shown in SEQ ID NO:91, and where b is greater than or equal to a 
+ 14. 



FEATURES OF PROTEIN ENCODED BY GENE NO: 82 

30 The translation product of this contig has clear sequence identity with a 

number of thioredoxins and endoplasmic reticulum resident proteins (see, for 
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example, Shorrosh and Dixon, Plant J. 2:51-58 (1992), which is hereby incorporated 
by reference, herein). 

In specific embodiments, polypeptides of the invention comprise the following 
amino acid sequence: PCADCLSAWA (SEQ ID NO:363). Polynucleotides encoding 
5 these polypeptides are also encompassed by the invention.The gene encoding the 
disclosed cDNA is believed to reside on chromosome 5. Accordingly, 
polynucleotides related to this invention are useful as a marker in linkage analysis for 
chromosome 5. 

It has been discovered that this gene is expressed primarily in adipocytes and 
10 striatum depression, and in lower abundance in prostate, whole brain, fetal liver, and 
spleen. 

Therefore, nucleic acids of the invention are useful as reagents for differential 
identification of the tissue(s) or cell type(s) present in a biological sample and for 
diagnosis of the following diseases and conditions: Prostate cancer, CNS diseases, 

15 immune disorders ""Similarly ..polypeptides and antibodies directed to those 

polypeptides are useful to provide immunological probes for differential identification 
of the tissue(s) or cell type(s). For a number of disorders of the above tissues or cells, 
particularly of the immune, expression of this gene at significantly higher or lower 
levels may be detected in certain tissues or cell types (e.g., neural, hematopoietic, 

20 immune, and cancerous and wounded tissues) or bodily fluids (e.g., seminal fluid, 
amniotic fluid, serum, plasma, urine, synovial fluid or spinal fluid) taken from an 
individual having such a disorder, relative to the standard gene expression level, i.e., 
the expression level in healthy tissue from an individual not having the disorder. 
Since the translation product of this clone has a high degree of sequence relatedness 

25 to many thioredoxins, it can be used as a food additive to improve flour quality or to 
suppress the anti-nutritional effects of leguminous plants. Molecules of the present 
invention can further used to inactivate toxins, for example, bee or snake venom. 

Preferred epitopes include those comprising a ; sequence~shownin~SEQID NO.~ 
189 as residues: Trp-43 to Ala-49, Pro-68 to Ala-74, Glu-100 to Gly-1 1 1, Glu-120 to 

30 Asn-125, Pro-141 to Ala- 154, Asp-157 to Lys-171, Cys-177 to Ile-182, Ser-248 to 
Leu-253, Thr-280 to Glu-285, GIy-353 to Val-359. 
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The tissue distribution in whole brain suggests that the protein product of this 
clone would be useful for the detection, treatment, and/or prevention of 
neurodegenerative disease states, behavioral disorders, or inflammatory conditions 
which include, but are not limited to Alzheimer's Disease, Parkinson's Disease, 
5 Huntington's Disease, Tourette Syndrome, meningitis, encephalitis, demyelinating 
diseases, peripheral neuropathies, neoplasia, trauma, congenital malformations, spinal 
cord injuries, ischemia and infarction, aneurysms, hemorrhages, schizophrenia, 
mania, dementia, paranoia, obsessive compulsive disorder, depression, panic disorder, 
learning disabilities, ALS, psychoses, autism, and altered behaviors, including 

10 disorders in feeding, sleep patterns, balance, and perception. In addition, elevated 

Expression of this gene product in regions of the brain suggests it plays a role 
in normal neural function. Potentially, this gene product is involved in synapse 
formation, neurotransmission, learning, cognition, homeostasis, or neuronal 
differentiation or survival. The secreted protein can also be used to determine 

15 biological activity," to raise antibodies, as tissue markers, to isolate cognate ligarids or 
receptors, to identify agents that modulate their interactions, and as nutritional 
supplements. It may also have a very wide range of biological activities. Typical of 
these are cytokine, cell proliferation/differentiation modulating activity or induction 
of other cytokines; immunostimulating/immunosuppressant activities (e.g. for treating 

20 human immunodeficiency virus infection, cancer, autoimmune diseases and allergy); 
regulation of hematopoiesis (e.g. for treating anemia or as adjunct to chemotherapy); 
stimulation or growth of bone, cartilage, tendons, ligaments and/or nerves (e.g. for 
treating wounds, stimulation of follicle stimulating hormone (for control of fertility); 
chemotactic and chemokinetic activities (e.g. for treating infections, tumors); 

25 hemostatic or thrombolytic activity (e.g. for treating hemophilia, cardiac infarction 
etc.); anti-inflammatory activity (e.g. for treating septic shock, Crohn's disease); as 
antimicrobials; for treating psoriasis or other hyperproliferative diseases; for 
regulation of metabolism,lind behavior "Also cdmempl^d~islfie~u^of the 
corresponding nucleic acid in gene therapy procedures. Protein, as well as, antibodies 

30 directed against the protein may show utility as a tumor marker and/or 
immunotherapy targets for the above listed tissues. 
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Many polynucleotide sequences, such as EST sequences, are publicly 
. available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO:92 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
5 excluded from the scope of the present invention. To list every related sequence 

would be cumbersome. Accordingly, preferably excluded from the present invention 
are one or more polynucleotides comprising a nucleotide sequence described by the 
general formula of a-b, where a is any integer between 1 to 1368 of SEQ ID NO:92, b 
is an integer of 15 to 1382, where both a and b correspond to the positions of 
10 nucleotide residues shown in SEQ ID NO:92, and where b is greater than or equal to a 
+ 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 83 

When tested against TF-1 cell lines, supernatants removed from cells 
15 containing this gerie activated the ISRE (interferon-sensitive responsive element ) 
promoter element. Thus, it is likely that this gene activates myeloid cells, and to a 
lesser extent, in immune and hematopoietic cells or tissues, through the JAK-STAT 
signal transduction pathway. ISRE is a promoter element found upstream in many 
genes which are involved in the Jak-STAT pathway. The Jak-STAT pathway is a 
20 large, signal transduction pathway involved in the differentiation and proliferation of 
cells. Therefore, activation of the Jak-STAT pathway, reflected by the binding of the 
ISRE element, can be used to indicate proteins involved in the proliferation and 
differentiation of cells. 

In specific embodiments, polypeptides of the invention comprise the following 
25 amino acid sequence: HAS G YLCI VLL (SEQ ID NO:364). Polynucleotides encoding 
these polypeptides are also encompassed by the invention. 

It has been discovered that this gene is expressed exclusively in Rejected 
Kidney. 

Therefore, nucleic acids of the invention are useful as reagents for differential 
30 identification of the tissue(s) or cell type(s) present in a biological sample and for 
diagnosis of the following diseases and conditions: kidney and other urinary tract 



BNSOOCIO:<WO 9947540A1 I > 



WO 99/47540 



PCT/US99/05804 



disorders and disorders related to, or resulting from, transplantation. Similarly, 
polypeptides and antibodies directed to those polypeptides are useful to provide 
immunological probes for differential identification of the tissue(s) or cell type(s). For 
a number of disorders of the above tissues or cells, particularly of the immune and 
5 renal systems, expression of this gene at significantly higher or lower levels may be 
detected in certain tissues or cell types (e.g., renal, kidney, urogenital, immune, 
hematopoietic, and cancerous and wounded tissues) or bodily fluids (e.g., lymph, 
- serum, plasma, urine, synovial fluid or spinal fluid) taken from an individual having 
such a disorder, relative to the standard gene expression level, i.e., the expression 

10 level in healthy tissue from an individual not having the disorder. Molecules of the 
present invention are particularly useful in the diagnosis and treatment of disorders 
related to transplantation, particularly kidney transplantation. 

Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 
190 as residues: Asn-49 to Gln-54, Glu-150 to Asp-159. 

15 The tissue distribution iri rejected kidney tissue suggests that the protein 

product of this clone would be useful for diagnosis and treatment of disorders related 
to or resulting from rejection of transplanted organs, particularly the kidney. 
Moreover, the protein product of this clone could be used in the treatment and/or 
detection of kidney diseases including renal failure, nephritus, renal tubular acidosis, 

20 proteinuria, pyuria, edema, pyelonephritis, hydronephritis, nephrotic syndrome, crush 
syndrome, glomerulonephritis, hematuria, renal colic and kidney stones, in addition to 
Wilm's Tumor Disease, and congenital kidney abnormalities such as horseshoe 
kidney, polycystic kidney, and Falconi's syndrome. Considering the tissue distribution 
and detected ISRE biological activity, the protein is useful in modulating the immune 

25 response to aberrant kidney proteins, including autoantigens and aberrant proteins 
which are often present in degenerative and proliferative conditions. Protein, as well 
as, antibodies directed against the protein may show utility as a tumor marker and/or 

~ " "~ immunotherapy targets for the above listed tissues: 

Many polynucleotide sequences, such as EST sequences, are publicly 

30 available and accessible through sequence databases. Some of these sequences are 

related to SEQ ID NO:93 and may have been publicly available prior to conception of 
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the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence 
would be cumbersome. Accordingly, preferably excluded from the present invention 
are one or more polynucleotides comprising a nucleotide sequence described by the 
5 general formula of a-b, where a is any integer between 1 to 1733 of SEQ ID NO:93, b 
is an integer of 15 to 1747, where both a and b correspond to the positions of 
nucleotide residues shown in SEQ ID NO:93, and where b is greater than or equal to a 
+ 14. 

10 FEATURES OF PROTEIN ENCODED BY GENE NO: 84 

The translation product of this gene shares sequence homology with the 
conserved MAL and plasmolipin protein (Magyar, et al, Gene 189:269-275 (1997); 
See Genbank Accession No.gnllPIDIel 83885), which are thought to be important in 
modulating T cell function, and proper CNS function, respectively. When tested 

15 against Jurkat cell lines, ;supernatants removed from cells containing this gene 

activated the GAS (gamma activating sequence) promoter element. Thus, it is likely 
that this gene activates myeloid cells, and to a lesser extent, immune or hematopoietic 
cells and tissues, through the JAK-STAT signal transduction pathway. GAS is a 
promoter element found upstream of many genes which are involved in the Jak-STAT 

20 pathway. The Jak-STAT pathway is a large, signal transduction pathway involved in 
the differentiation and proliferation of cells. Therefore, activation of the Jak-STAT 
pathway, reflected by the binding of the GAS element, can be used to indicate 
proteins involved in the proliferation and differentiation of cells. 

In specific embodiments, polypeptides of the invention comprise the following 

25 amino acid sequence: NSARAARAEIVLGLLVWTLIAGTEYFRVPAFGWV (SEQ 
ID NO:365). Polynucleotides encoding these polypeptides are also encompassed by 
the invention. 

It has been discovered that this gene is expressed~primaxil>rin T cells. ~ 

Therefore, nucleic acids of the invention are useful as reagents for differential 
30 identification of the tissue(s) or cell type(s) present in a biological sample and for 

diagnosis of immune, hematopoietic, and neural diseases and/or disorders. Similarly, 
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polypeptides and antibodies directed to those polypeptides are useful to provide 
immunological probes for differential identification of the tissue(s) or cell type(s). For 
a number of disorders of the above tissues or cells, particularly of the immune system, 
expression of this gene at significantly higher or lower levels may be detected in 
5 certain tissues or cell types (e.g., immune, hematopoietic, and cancerous and wounded 
tissues) or bodily fluids (e.g., lymph, serum, plasma, urine, synovial fluid or spinal 
fluid) taken from an individual having such a disorder, relative to the standard gene 
expression level, i.e., the expression level in healthy tissue from an individual not 
having the disorder. Nucleic acids of the present invention are useful as probes for 

10 detecting traumatic and pathological changes in the central and peripheral nervous 

systems. Molecules of the present invention may be involved in regulating the growth 
of Schwann cells and other neural cells. Molecules of the present invention are also 
useful as modulators of the interaction between Schwann cells and other neural cells 
and the extracellular matrix and is therefore useful for the therapeutic intervention in 

15 nerve damage primarily by facilitating regeneration of damaged axons and 
regenerating nerve cells in damaged nervous system tissues. 

Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 
191 as residues: Ser-58 to His-64. 

The tissue distribution in T-cells, combined with the homology to the MAL 

20 and plasmolipin proteins and the detected GAS biological activity suggests that the 
protein product of this clone would be useful for the diagnosis and treatment of 
immune disorders including, but not limited to, AIDS and other immunodeficiencies. 
Morever, the expression of this gene product suggests a role in regulating the 
proliferation; survival; differentiation; and/or activation of hematopoietic cell 

25 lineages, including blood stem cells. This gene product may be involved in the 

regulation of cytokine production, antigen presentation, or other processes suggesting 
a usefulness in the treatment of cancer (e.g. by boosting immune responses). 

_ Since the gene is expressed in cellsof lymphoid "origiri, the^natural gerie 

product may be involved in immune functions. Therefore it may be also used as an 

30 agent for immunological disorders including arthritis, asthma, leukemia, rheumatoid 
arthritis, granulomatous disease, inflammatory bowel disease, sepsis, acne, 
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neutropenia, neutrophilia, psoriasis, hypersensitivities, such as T-cell mediated 
cytotoxicity; immune reactions to transplanted organs and tissues, such as host- 
versus-graft and graft-versus-host diseases, or autoimmunity disorders, such as 
autoimmune infertility, lense tissue injury, demyelination, systemic lupus 
5 erythematosis, drug induced hemolytic anemia, rheumatoid arthritis, Sjogren's 
disease, scleroderma and tissues. Moreover, the protein may represent a secreted 
factor that influences the differentiation or behavior of other blood cells, or that 
recruits hematopoietic cells to sites of injury. In addition, this gene product may have 
commercial utility in the expansion of stem cells and committed progenitors of 
10 various blood lineages, and in the differentiation and/or proliferation of various cell 
types. 

The secreted protein can also be used to determine biological activity, to raise 
antibodies, as tissue markers, to isolate cognate ligands or receptors, to identify agents 
that modulate their interactions, and as nutritional supplements. It may also have a 

15 very wide range orbiological activities". Typical of these are cytokine, cell 

proliferation/differentiation modulating activity or induction of other cytokines; 
immunostimulating/immunosuppressant activities (e.g. for treating human 
immunodeficiency virus infection, cancer, autoimmune diseases and allergy); 
regulation of hematopoiesis (e.g. for treating anemia or as adjunct to chemotherapy); 

20 stimulation or growth of bone, cartilage, tendons, ligaments and/or nerves (e.g. for 
treating wounds, stimulation of follicle stimulating hormone (for control of fertility); 
chemotactic and chemokinetic activities (e.g. for treating infections, tumors); . 
hemostatic or thrombolytic activity (e.g. for treating hemophilia, cardiac infarction 
etc.); anti-inflammatory activity (e.g. for treating septic shock, Crohn's disease); as 

25 antimicrobials; for treating psoriasis or other hyperproliferative diseases; for 
regulation of metabolism, and behavior. Also contemplated is the use of the 
corresponding nucleic acid in gene therapy procedures. Protein, as well as, antibodies 

directed ^g^nst the protein — 

immunotherapy targets for the above listed tissues. 

30 Many polynucleotide sequences, such as EST sequences, are publicly 

available and accessible through sequence databases. Some of these sequences are 



WO 99/47540 



PCT/US99/05804 



158 

related to SEQ ID NO:94 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence 
would be cumbersome. Accordingly, preferably excluded from the present invention 
5 are one or more polynucleotides comprising a nucleotide sequence described by the 
general formula of a-b, where a is any integer between 1 to 586 of SEQ ID NO:94, b 
is an integer of 15 to 600, where both a and b correspond to the positions of 
nucleotide residues shown in SEQ ID NO:94, and where b is greater than or equal to a 
+ 14. 

10 

FEATURES OF PROTEIN ENCODED BY GENE NO: 85 

The translation product of this clone has sequence identity to a protein 
tyrosine kinase reported by Oates and Wilks (The Worm Breeders Gazette 14:87-87 
(1995), which is hereby incorporated by reference herein). The gene encoding the 
15 disclosed cDNA is believed to reside on chromosome 2. Accordingly, 

polynucleotides related to this invention are useful as a marker in linkage analysis for 
chromosome 2. 

It has been discovered that this gene is expressed primarily in cerebellum, 
adult brain, retina, spinal cord, and kidney cortex. 

20 Therefore, nucleic acids of the invention are useful as reagents for differential 

identification of the tissue(s) or cell type(s) present in a biological sample and for 
diagnosis of the following diseases and conditions: neural, visual, and renal diseases 
and/or disorders. Similarly, poly peptides and antibodies directed to those 
polypeptides are useful to provide immunological probes for differential identification 

25 of the tissue(s) or cell type(s). For a number of disorders of the above tissues or cells, 
particularly of the CNS, retina, and kidney cortex. Expression of this gene at 
significantly higher or lower levels may be detected in certain tissues or cell types 
(e.g., neural; visual; renal; and cancerous~Md wouM 

lymph, serum, plasma, urine, synovial fluid or spinal fluid) taken from an individual 
30 having such a disorder, relative to the standard gene expression level, i.e., the 
expression level in healthy tissue from an individual not having the disorder. 
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The tissue distribution in cerebellum, adult brain, and spinal cord tissue 
suggests that the protein product of this clone would be useful for the diagnosis and 
treatment of neural diseases and disorders. The protein product qf this clone is useful 
for the detection, treatment, and/or prevention of neurodegenerative disease states, 
5 behavioral disorders, or inflammatory conditions which include, but are not limited to 
Alzheimer's Disease, Parkinson's Disease, Huntington's Disease, Tourette Syndrome, 
meningitis, encephalitis, demyelinating diseases, peripheral neuropathies, neoplasia, 
trauma, congenital malformations, spinal cord injuries, ischemia and infarction, 
aneurysms, hemorrhages, schizophrenia, mania, dementia, paranoia, obsessive 
10 compulsive disorder, depression, panic disorder, learning disabilities, ALS, 
psychoses, autism, and altered behaviors, including disorders in feeding, sleep 
patterns, balance, and perception. In addition, elevated 

Expression of this gene product in regions of the brain suggests it plays a role 
in normal neural function. Potentially, this gene product is involved in synapse 

15 formation, neurotransmission, Jearning; cognition, homeostasis, or neuronal 

differentiation or survival. Moreover, the protein product of this clone could be used 
in the treatment and/or detection of kidney diseases including renal failure, nephritus, 
renal tubular acidosis, proteinuria, pyuria, edema, pyelonephritis, hydronephritis, 
nephrotic syndrome, crush syndrome, glomerulonephritis, hematuria, renal colic and 

20 kidney stones, in addition to Wilm's Tumor Disease, and congenital kidney 

abnormalities such as horseshoe kidney, polycystic kidney, and Falconi's syndrome. 
Protein, as well as, antibodies directed against the protein may show utility as a tumor 
marker and/or immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 

25 available and accessible through sequence databases. Some of these sequences are 

related to SEQ ID NO:95 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 

excluded ff dnTtHe^scdpe of tHe^reseririnvention7 To"lisreWry _ relate^Tequence 

would be cumbersome. Accordingly, preferably excluded from the present invention 
30 are one or more polynucleotides comprising a nucleotide sequence described by the 
general formula of a-b, where a is any integer between 1 to 572 of SEQ ID NO:95, b 
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is an integer of 15 to 586, where both a and b correspond to the positions of 
nucleotide residues shown in SEQ ID NO:95, and where b is greater than or equal to a 
+ 14. 

5 FEATURES OF PROTEIN ENCODED BY GENE NO: 86 

The translation product of this clone has homology to trkB, and it is thought 
that the protein of the present invention is a novel novel neural receptor protein- 
tyrosine kinase, a trkB homolog (See for example, ). This protein is likely to be 
derived from a gene for a ligand-regulated receptor closely related to the human trk 

10 oncogene. Northern (RNA) analysis showed that the trkB gene is expressed 

predominantly in the brain and that trkB expresses multiple mRNAs, ranging from 0.7 
to 9 kb. Hybridization of cerebral mRNAs with a variety of probes indicates that there 
are mRNAs encoding truncated trkB receptors. 

In specific embodiments, polypeptides of the invention comprise the sequence 

15 PCSPPDSPPLPG AFVWRVLWVC (SEQ ID NO:366). Polynucleotides encoding this 
polypeptide are also encompassed by the invention. 

It has been discovered that this gene is expressed primarily in breast cancer, 
colon tumor, and B-cell lymphoma. 

Therefore, nucleic acids of the invention are useful as reagents for differential 

20 identification of the tissue(s) or cell type(s) present in a biological sample and for 

diagnosis of the following diseases and conditions: breast cancer, colon tumor, B-cell 
lymphoma. Similarly, polypeptides and antibodies directed to those polypeptides are 
useful to provide immunological probes for differential identification of the tissue(s) 
or cell type(s). For a number of disorders of the above tissues or cells, particularly of 

25 the immune, expression of this gene at significantly higher or lower levels may be 
detected in certain tissues or cell types (e.g., neural, gastrointestinal, immune, and 
cancerous and wounded tissues) or bodily fluids (e.g., lymph, serum, plasma, urine, 
synovial fluid or spinal fluid) taken from an individuarh^vin^^uch^^i^oi^erT 
relative to the standard gene expression level, i.e., the expression level in healthy 

30 tissue from an individual not having the disorder. 
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Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 
193 as residues: Ser-29 to Asn-40. 

The tissue distribution in proliferative cells and tissues suggests that the 
protein product of this clone would be useful for the treatment, detection, and/or 
5 prevention of cancer, particularly in the indicated tissues. The expression within 

cellular sources marked by proliferating cells suggests this protein may play a role in 
the regulation of cellular division, and may show utility in the diagnosis and treatment 
of cancer and other proliferative disorders. Similarly, developmental tissues rely on 
decisions involving cell differentiation and/or apoptosis in pattern formation. 
10 Dysregulation of apoptosis can result in inappropriate suppression of cell death, as 
occurs in the development of some cancers, or in failure to control the extent of cell 
death, as is believed to occur in acquired immunodeficiency and certain 
neurodegenerative disorders, such as spinal muscular atrophy (SMA). Therefore, the 
polynucleotides and polypeptides of the present invention are useful in treating, 

15 detecting, and/or preventing said disorders and conditions, in addition to other types 
of degenerative conditions. Thus this protein may modulate apoptosis or tissue 
differentiation and would be useful in the detection, treatment, and/or prevention of 
degenerative or proliferative conditions and diseases. 

Alternatively, the homology to the trkB protein suggests the protein product of 

20 this clone is useful for the detection, treatment, and/or prevention of 

neurodegenerative disease states, behavioral disorders, or inflammatory conditions 
which include, but are not limited to Alzheimer's Disease, Parkinson's Disease, 
Huntington's Disease, Tourette Syndrome, meningitis, encephalitis, demyelinating 
diseases, peripheral neuropathies, neoplasia, trauma, congenital malformations, spinal 

25 cord injuries, ischemia and infarction, aneurysms, hemorrhages, schizophrenia, 

mania, dementia, paranoia, obsessive compulsive disorder, depression, panic disorder, 
learning disabilities, ALS, psychoses, autism, and altered behaviors, including 
disorders in feeding, sleep patterns, balance, and perceptionTln^ddition, elevated 
expression of this gene product in regions of the brain suggests it plays a role in 

30 normal neural function. Potentially, this gene product is involved in synapse 
formation, neurotransmission, learning, cognition, homeostasis, or neuronal 
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differentiation or survival. Protein, as well as, antibodies directed against the protein 
may show utility as a tumor marker and/or immunotherapy targets for the above listed 
tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
5 available and accessible through sequence databases. Some of these sequences are 

related to SEQ ID NO:96 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence 
would be cumbersome. Accordingly, preferably excluded from the present invention 
10 are one or more polynucleotides comprising a nucleotide sequence described by the 
general formula of a-b, where a is any integer between 1 to 788 of SEQ ID NO:96, b 
is an integer of 15 to 802, where both a and b correspond to the positions of 
nucleotide residues shown in SEQ ID NO:96, and where b is greater than or equal to a 
+ 14. 

15 - . : - ' • . ' ' • ' 

FEATURES OF PROTEIN ENCODED BY GENE NO: 87 

In specific embodiments, polypeptides of the invention comprise the following 

amino acid sequence: ARACFAYNGVCSEGRCWDSHFHGSV (SEQ ID NO:367), 

MSNMGKIPSLSLHIPINKYICSRIPKFIQKVNKSTVLQICLKRQnLNKNKMSDH 
20 SKIGKANLVQIDIHSLGIVETGCVPSKRYCTLLTEQSGFPFLSHP (SEQ ID 

NO:368), 

MAGCCLKLFGVLSLCFLCGLISIERVICNPVSADFQVSTFCQRHCLLR 
SKVMFXIKGXTATIEVINENCTLVAAPPIGFPIXFL (SEQ ID NO:369), MSDHS 
KIGKANLVQIDIHSLGIVETGCVPSKRYCTLLTEQSGFPFLSHP (SEQ ID 

25 NO:370), MAGCCLKLFGVLSLCFLCGLISIERVICNPVSADFQVSTFCQRHCL 
LRSK (SEQ ID NO:371), VMFXIKGXTATIEVINENCTLVAAPPIGFPIXFL (SEQ 
ID NO:372). Polynucleotides encoding these polypeptides are also encompassed by 
the invention. ~ 

It has been discovered that this gene is expressed primarily in dendritic cells, 

30 and smooth muscle. 
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Therefore, nucleic acids of the invention are useful as reagents for differential 
identification of the tissue(s) or cell type(s) present in a biological sample and for 
diagnosis of the following diseases and conditions: immune, hematopoietic, and 
vascular diseases and/or disorders. Similarly, polypeptides and antibodies directed to 
5 those polypeptides are useful to provide immunological probes for differential 

identification of the tissue(s) or cell type(s). For a number of disorders of the above 
tissues or cells, particularly of the immune, expression of this gene at significantly 
higher or lower levels may be detected in certain tissues (e.g., immune, 
hematopoietic, . smooth muscle vascular, and cancerous and wounded tissues) or 
10 bodily fluids (e.g., lymph, serum, plasma, urine, synovial fluid or spinal fluid) taken 
from an individual having such a disorder, relative to the standard gene expression 
level, i.e., the expression level in healthy tissue from an individual not having the 
disorder. 

Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 
15 194 as residues: Asp-40 to Ser-52. " 

The tissue distribution in dendritic cells suggests that the protein product of 
this clone would be useful for immune disorders. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
20 related to SEQ ID NO:97 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence 
would be cumbersome. Accordingly, preferably excluded from the present invention . 
are one or more polynucleotides comprising a nucleotide sequence described by the 
25 general formula of a-b, where a is any integer between 1 to 1212 of SEQ ID NO:97, b 
is an integer of 15 to 1226, where both a and b correspond to the positions of 
nucleotide residues shown in SEQ ID NO: 97, and where b is greater than or equal to a 
+ 14" 

30 FEATURES OF PROTEIN ENCODED BY GENE NO: 88 



WO 99/47540 



PCT/US99/05804 



164 

The translation product of this gene shares sequence homology with androgen- 
dependant expressed protein from golden hamster hair follicles which is thought to be 
important in regulating the secretions from glands in the skin (See GenBank 
Accession No. gill91315). 

In specific embodiments, polypeptides of the invention comprise the following 
amino acid sequence: PTEGRQKVLKTFTVPRSALAMTKTSTCIYHFLVLSWYTF 
LNYYISQEGKDEVKPKILANGARWKY (SEQ ID NO:373), PTEGRQKVLKTF 
TVPRSALAMTKT (SEQ ID NO:375), PRSALAMTKTSTCIYHFLVLSWYTFLN 
YYISQEGK (SEQ ID NO:374), and/or FLN Y YIS QEGKDE VKPKEL ANG AR WK Y 
(SEQ ID NO:376). Polynucleotides encoding these polypeptides are also 
encompassed by the invention. 

It has been discovered that this gene is expressed primarily in lung, colon 
cancer, and testis. 

Therefore, nucleic acids of the invention are useful as reagents for differential 
identification of the'tissue(s) or cell type(s) present in a biological sample and for 
diagnosis of the following diseases and conditions: disorders of secretory cells 
including cells in the lung, colon, testis and the skin. Similarly, polypeptides and 
antibodies directed to those polypeptides are useful to provide immunological probes 
for differential identification of the tissue(s) or cell type(s). For a number of disorders 
of the above tissues or cells, particularly of the secretory epithelial cells in the lung, 
intestine, testis and skin, expression of this gene at significantly higher or lower levels 
may be detected in certain tissues (e.g., cancerous and wounded tissues) or bodily 
fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) taken from an 
individual having such a disorder, relative to the standard gene expression level, i.e., 
the expression level in healthy tissue from an individual not having the disorder. 

Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 
195 as residues: Val-21 to Asp-30, Pro-101 to Thr-109. 

The tissue distribution and homology to androgen regulated protein suggests 
that the protein product of this clone would be useful for treating disorders that 
involve highly secretory cells including those in the colon, testis, and skin. It may be 
useful for diagnosing disorders such as colon, lung, or testicular cancer and may be 
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used to treat pulmonary conditions in patients with compromised respiratory function. 
In addition, the polynucleotides and polypeptides corresponding to this gene are 
useful for the treatment and diagnosis of conditions concerning proper testicular 
function (e.g. endocrine function, sperm maturation), as well as cancer. Therefore, 
this gene product is useful in the treatment of male infertility and/or impotence. This 
gene product is also useful in assays designed to identify binding agents, as such 
agents (antagonists) are useful as male contraceptive agents. 

Similarly, the protein is believed to be useful in the treatment and/or diagnosis 
of testicular cancer. The testes are also a site of active gene expression of transcripts 
that may be expressed, particularly at low levels, in other tissues of the body. 
Therefore, this gene product may be expressed in other specific tissues or organs 
where it may play related functional roles in other processes, such as hematopoiesis, 
inflammation, bone formation, and kidney function, to name a few possible target 
indications. Protein, as well as, antibodies directed against the protein may show 
utility as a tumor marker aind/or immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO:98 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence 
would be cumbersome. Accordingly, preferably excluded from the present invention 
are one or more polynucleotides comprising a nucleotide sequence described by the 
general formula of a-b, where a is any integer between 1 to 1 106 of SEQ ID NO:98, b 
is an integer of 15 to 1 120, where both a and b correspond to the positions of 
nucleotide residues shown in SEQ ID NO:98, and where b is greater than or equal to a 
+ 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 89 

The translation product of this gene shares sequence homology with dec-205 a 
transmembrane protein which is thought to be important in antigen presentation in 
dendritic cells and T-cells. 
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It has been discovered that this gene is expressed primarily in macrophage, 
dendritic cells, lung and ulcerative colitis. 

Therefore, nucleic acids of the invention are useful as reagents for differential 
identification of the tissue(s) or cell type(s) present in a biological sample and for 
5 diagnosis of the following diseases and conditions: inflammatory diseases such as 
ulcerative colitis. Similarly, polypeptides and antibodies directed to those 
polypeptides are useful to provide immunological probes for differential identification 
of the tissue(s) or cell type(s). For a number of disorders of the above tissues or cells, 
particularly of the immune system, expression of this gene at significantly higher or 
10 lower levels may be detected in certain tissues (e.g., cancerous and wounded tissues) 
or bodily fluids (e.g., lymph, serum, plasma, urine, synovial fluid or spinal fluid) 
taken from an individual having such a disorder, relative to the standard gene 
expression level, i.e., the expression level in healthy tissue from an individual not 
having the disorder. 

15 Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 

196 as residues: Asp-30 to Arg-36, Gln-59 to Val-65. 

The distribution in macrophage, dendritic cells, lung and ulcerative colitis 
tissues, and homology to antigen presenting receptors suggests that the protein 
product of this clone would be useful for modulating the immune response in both 

20 acute and chronic inflammatory conditions. Protein, as well as, antibodies directed 

against the protein may show utility as a tumor marker and/or immunotherapy targets 
for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 

25 related to SEQ ID NO:99 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence 
would be cumbersome.^ccofdihgly : , prefefably^xcluded from^he~^esenrin wntioii 
are one or more polynucleotides comprising a nucleotide sequence described by the 

30 general formula of a-b, where a is any integer between 1 to 2582 of SEQ ID NO:99, b 
is an integer of 1 5 to 2596, where both a and b correspond to the positions of 
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nucleotide residues shown in SEQ ID NO:99, and where b is greater than or equal to a 
+ 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 90 
5 This gene maps to chromosome 22 and therefore polynucleotides of the 

present invention can be used in linkage analysis as a marker for chromosome 22. 

In specific embodiments, polypeptides of the invention comprise the sequence 
FKDQLVYPLLAFT (SEQ ID NO:377) and/or RQALNLPDVFGLV (SEQ ID 
NO:379). Polnucleotides encoding these polypeptides are also encompassed by the 
10 invention. 

It has been discovered that this gene is expressed primarily in fetal spleen and 
liver as well as cd34 positive cells and to a lesser extent in several tissues suggesting a 
presence in blood or blood forming tissues. 

Therefore, nucleic acids of the invention are useful as reagents for differential 

15 identification of the tissue(s) or cell type(s) present in a biological sample and for 
diagnosis of the following diseases and conditions: developmental defects in the 
blood and blood forming cells. Similarly, polypeptides and antibodies directed to 
those polypeptides are useful to provide immunological probes for differential 
identification of the tissue(s) or cell type(s). For a number of disorders of the above 

20 tissues or cells, particularly of the immune system, expression of this gene at 

significantly higher or lower levels may be detected in certain tissues or cell types 
(e.g., fetal spleen and liver as well as cd34 positive cells, cancerous and wounded 
tissues) or bodily fluids (e.g., lymph, serum, plasma, urine, synovial fluid or spinal 
fluid) taken from an individual having such a disorder, relative to the standard gene 

25 expression level, i.e., the expression level in healthy tissue from an individual not 
having the disorder. 

Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 

197 asTesidues: "GlS34To"Gly^l7Asn-79lo 

Gin- 126, Pro- 128 to Phe-134, Arg-150 to Arg-156, Arg-160 to Arg-170. 

30 The tissue distribution in fetal spleen and liver as well as cd34 positive cells 

suggests that the protein product of this clone would be useful for treating disorders in 
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the development, proliferation, or regulation of blood forming cells including diseases 
such as lymphomas, granulomas, leukemias, and in the preservation and or 
replenishment of stem cells in the blood. 

Many polynucleotide sequences, such as EST sequences, are publicly 
5 available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO: 100 and may have been publicly available prior to conception 
of the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence 
would be cumbersome. Accordingly, preferably excluded from the present invention 
10 are one or more polynucleotides comprising a nucleotide sequence described by the 
general formula of a-b, where a is any integer between 1 to 1006 of SEQ ID NO: 100, 
b is an integer of 15 to 1020, where both a and b correspond to the positions of 
nucleotide residues shown in SEQ ID NO: 100, and where b is greater than or equal to 
a + 14. 

15 - : " ' - , • - . - 

FEATURES OF PROTEIN ENCODED BY GENE NO: 91 

In specific embodiments, polypeptides of the invention comprise the following 
amino acid sequence: ATASHDLLLF (SEQ ID NO:379), MSINICLMQSKTQGSCQ 
YLLLPHPVPIILKVSTVFSLLSLFRLLFLSFCPHPKKCSYLLKYYGPLEGHKTLX 

20 YLRTNLGVIQPPLRMYAAEDCNGIG (SEQ ID NO:380), MSINICLMQSKTQG 
SCQYLLLPHPVPIILKVSTVFSLLSLFRLLFL (SEQ ID NO:381), and/or 
SFCPHPK KCSYLLKYYGPLEGHKTLXYLRTNLGVIQPPLRMYAAEDCNGIG 
(SEQ ID NO:382). Polynucleotides encoding these polypeptides are also 
encompassed by the invention. 

25 It has been discovered that this gene is expressed primarily in T cells, fetal 

heart and chronic lymphocytic leukemia and to a lesser extent in kidney, lung, and 16 
week embryos. 

Therefore, nucleic acidsof the^invehtidh~are us^ful^reagents^for differential 
identification of the tissue(s) or cell type(s) present in a biological sample and for 
30 diagnosis of the following diseases and conditions: disorders of the blood including 
abnormalities in T cell function or blood cell proliferation such as leukemia . 
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Similarly, polypeptides and antibodies directed to those polypeptides are useful to 
provide immunological probes for differential identification of the tissue(s) or cell 
type(s). For a number of disorders of the above tissues or cells, particularly of the 
immune system, expression of this gene at significantly higher or lower levels may be 
detected in certain tissues or cell types (e.g., T cells, fetal heart and chronic 
lymphocytic leukemia, cancerous and wounded tissues) or bodily fluids (e.g., lymph, 
serum, plasma, urine, synovial fluid or spinal fluid) taken from an individual having 
such a disorder, relative to the standard gene expression level, i.e., the expression 
level in healthy tissue from an individual not having the disorder. 

Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 
198 as residues: Leu-45 to Val-50. 

The tissue distribution in T cells, fetal heart and chronic lymphocytic leukemia 
suggests that the protein product of this clone would be useful for treating 
abnormalities of the blood particularly those involving T-cells and the abnormal 
proliferation of blood cells such; as lymphocytic leukemia. In addition, it suggests the 
protein product of this clone is useful for the diagnosis and treatment of a variety of 
immune system disorders. Morever, the expression of this gene product suggests a 
role in regulating the proliferation; survival; differentiation; and/or activation of 
hematopoietic cell lineages, including blood stem cells. This gene product may be 
involved in the regulation of cytokine production, antigen presentation, or other 
processes suggesting a usefulness in the treatment of cancer (e.g. by boosting immune 
responses). 

Since the gene is expressed in cells of lymphoid origin, the natural gene 
product may be involved in immune functions. Therefore it may be also used as an 
agent for immunological disorders including arthritis, asthma, immunodeficiency 
diseases such as AIDS, leukemia, rheumatoid arthritis, granulomatous disease, 
inflammatory bowel disease, sepsis, acne, neutropenia, neutrophilia, psoriasis, 
hypersensitivitiesT^chTs T-ceU 

transplanted organs and tissues, such as host-versus-graft and graft-versus-host 
diseases, or autoimmunity disorders, such as autoimmune infertility, lense tissue 
injury, demyelination, systemic lupus erythematosis, drug induced hemolytic anemia, 
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rheumatoid arthritis, Sjogren's disease, scleroderma and tissues. Moreover, the protein 
may represent a secreted factor that influences the differentiation or behavior of other 
blood cells, or that recruits hematopoietic cells to sites of injury. In addition, this gene 
product may have commercial utility in the expansion of stem cells and committed 
5 progenitors of various blood lineages, and in the differentiation and/or proliferation of 
various cell types. 

The expression in fetal heart tissue would suggest a useful role for the protein 
product in developmental abnormalities, fetal deficiencies, pre-natal disorders and 
variouswould-healing models and/or tissue trauma. The tissue distribution in kidney 

10 suggests the protein product of this clone could be used in the treatment and/or 

detection of kidney diseases including renal failure, nephritus, renal tubular acidosis, 
proteinuria, pyuria, edema, pyelonephritis, hydronephritis, nephrotic syndrome, crush 
syndrome, glomerulonephritis, hematuria, renal colic and kidney stones, in addition to 
Wilm's Tumor Disease, and congenital kidney abnormalities such as horseshoe 

15 kidney, polycystkrkidney, and Falconi's syndrome. 

In addition, the tissue distribution in embryonic tissue suggests the protein 
product of this clone is useful for the diagnosis, detection, and/or treatment of 
developmental disorders. Expression within embryonic tissue and other cellular 
sources marked by proliferating cells suggests this protein may play a role in the 

20 regulation of cellular division, and may show utility in the diagnosis and treatment of 
cancer and other proliferative disorders. Similarly, developmental tissues rely on 
decisions involving cell differentiation and/or apoptosis in pattern formation. 
Dysregulation of apoptosis can result in inappropriate suppression of cell death, as 
occurs in the development of some cancers, or in failure to control the extent of cell 

25 death, as is believed to occur in acquired immunodeficiency and certain 

neurodegenerative disorders, such as spinal muscular atrophy (SMA). Therefore, the 
polynucleotides and polypeptides of the present invention are useful in treating, 
detecting, and/or preventing said disorders and conditions, in addition to other types 
of degenerative conditions. Thus this protein may modulate apoptosis or tissue 

30 differentiation and would be useful in the detection, treatment, and/or prevention of 
degenerative or proliferative conditions and diseases. Protein, as well as, antibodies 
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directed against the protein may show utility as a tumor marker and/or 
immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
5 related to SEQ ID NO: 101 and may have been publicly available prior to conception 
of the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence 
would be cumbersome. Accordingly, preferably excluded from the present invention 
are one or more polynucleotides comprising a nucleotide sequence described by the 
10 general formula of a-b, where a is any integer between 1 to 1506 of SEQ ID NO: 101, 
b is an integer of 15 to 1520, where both a and b correspond to the positions of 
nucleotide residues shown in SEQ ID NO: 101, and where b is greater than or equal to 
a+ 14. 

15 FEATURES OF PROTEIN ENCODED BY GENE NO: 92 

The translation product of this gene shares sequence homology with ctg4 
which is a glutamine repeat containing gene thought to be a candidate genetic disease 
locus. 

In specific embodiments, polypeptides of the invention comprise the sequence 
20 KEEDDDTERLPSKCEVCKLLSTE (SEQ ID NO:383 and 384) LQAELSRTGRSR 
EVLELGQ (SEQ ID NO:385 and 386), RQAVIVCRRRFV (SEQ ID NO:387), 
PPRWAHPKAPEGSPDPPSPPSALGLSVLPWSDSDPWHISVSPCAQREHYSPGS 
AHINSLRPLPALSLKRCKARVSSSCLYPAPAPAPAPLEIDRCDSVPPVALCSAA 
YTLRI C W AS VLCHRPPPSTS QPKPR ARPKKG KAIFPT A Q VP (SEQ ID NO:388), 
25 PPRWAHPKAPEGSPDPPSPPSALGLSVLPWSDSDPWHISVSPCAQREHYSPGS 
AHINSLRPLPALSLKRCK (SEQ ID NO:389), and/or ARVSSSCLYPAPAPAPAPL 
EIDRCDSVPPVALCSAAYTLRICWASVLCHRPPPSTSQPKPRARPKKGKAIFPT 
AQVP (SEQ ID NO:390). Polynucleotides encoding these polypeptides are~aiscT 
encompassed by the invention. 
30 It has been discovered that this gene is expressed in several tissues including 

lung, heart, kidney, adrenal gland, smooth muscle, cerebellum, and embryonic tissue. 
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Therefore, nucleic acids of the invention are useful as reagents for differential 
identification of the tissue(s) or cell type(s) present in a biological sample and for 
diagnosis of the following diseases and conditions: inherited developmental disorders 
possibly with a neuropsychiatric component. Similarly, polypeptides and antibodies 
5 directed to those polypeptides are useful to provide immunological probes for 

differential identification of the tissue(s) or cell type(s). For a number of disorders of 
the above tissues or cells, particularly of the nervous system, expression of this gene 
at significantly higher or lower levels may be detected in certain tissues or cell types 
(e.g., cancerous and wounded tissues) or bodily fluids (e.g., lymph, serum, plasma, 

10 urine, synovial fluid or spinal fluid) taken from an individual having such a disorder, 
relative to the standard gene expression level, i.e., the expression level in healthy 
tissue from an individual not having the disorder. 

Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 
199 as residues: Lys-25 to Ser-36, Ser-53 to Glu-60, Thr-70 to Arg-75, Arg-1 1 1 to 

15 Thr-1 19, Glu-16lTo Leu-189. . " 

The tissue distribution and homology to glutamine repeat family member 
CTG4 suggests that the protein product of this clone would be useful for identifying 
and treating specific diseases related to nucleotide triplet expansion. The tissue 
distribution in embryonic tissue suggests the protein product of this clone is useful for 

20 the diagnosis, detection, and/or treatment of developmental disorders. The relatively 
specific expression of this gene product during embryogenesis suggests it may be a 
key player in the proliferation, maintenance, and/or differentiation of various cell 
types during development. It may also act as a morphogen to control cell and tissue 
type specification. Because of potential roles in proliferation and differentiation, this 

25 gene product may have applications in the adult for tissue regeneration and the 

treatment of cancers. Expression within embryonic tissue and other cellular sources 
marked by proliferating cells suggests this protein may play a role in the regulation of 
cellular division, and may show utility in the diagnosis and treatment of cancer and " 
other proliferative disorders. 

30 Many polynucleotide sequences, such as EST sequences, are publicly 

available and accessible through sequence databases. Some of these sequences are 
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related to SEQ ID NO: 102 and may have been publicly available prior to conception 
of the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence 
would be cumbersome. Accordingly, preferably excluded from the present invention 
5 are one or more polynucleotides comprising a nucleotide sequence described by the 
general formula of a-b, where a is any integer between 1 to 1292 of SEQ ID NO: 102, 
b is an integer of 15 to 1306, where both a and b correspond to the positions of 
nucleotide residues shown in SEQ ID NO: 102, and where b is greater than or equal to 
a + 14, 

10 

FEATURES OF PROTEIN ENCODED BY GENE NO: 93 

In specific embodiments, polypeptides of the invention comprise the following 
amino acid sequence: EEKLFTSAPGRDFWVMGETRDGNEEN (SEQ ID NO:391). 
Polynucleotides encoding these polypeptides are also encompassed by the invention. 
15 The gene encoding' the disclosed cDNA is believed to reside on chromosome 16. 
Accordingly, polynucleotides related to this invention are useful as a marker in 
linkage analysis for chromosome 16. 

It has been discovered that this gene is expressed primarily in cancerous and 
fetal tissue. 

20 Therefore, nucleic acids of the invention are useful as reagents for differential 

identification of the tissue(s) or cell type(s) present in a biological sample and for 
diagnosis of the following diseases and conditions: cancer, developmental anomalies 
or fetal deficiencies. Similarly, polypeptides and antibodies directed io those / 
polypeptides are useful to provide immunological probes for differential identification 

25 of the tissue(s) or cell type(s). For a number of disorders of the above tissues or cells, 
particularly of the reproductive system and developing fetus, expression of this gene 
at significantly higher or lower levels may be detected in certain tissues or cell types 
(e.g., developmental, reproductive, and cancerous and wounded Tissues) or bodily* 
fluids (e.g., lymph, amniotic fluid, serum, plasma, urine, synovial fluid or spinal fluid) 

30 taken from an individual having such a disorder, relative to the standard gene 
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expression level, i.e., the expression level in healthy tissue from an individual not 
having the disorder. 

Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 
200 as residues: Met-1 to Ser-6. 

The tissue distribution in fetal tissue suggests that the protein product of this 
clone would be useful for the treatment and diagnosis of developmental anomalies or 
fetal deficiencies. In addition to fetal tissue, expression in a variety of cancerous 
tissues suggests a role in the treatment and diagnosis of uncontrolled cell proliferation 
and/or differentiation (e.g. cancer). Moreover, the expression within embryonic tissue 
and other cellular sources marked by proliferating cells suggests this protein may play 
a role in the regulation of cellular division, and may show utility in the diagnosis and 
treatment of cancer and other proliferative disorders. 

Similarly, developmental tissues rely on decisions involving cell 
differentiation and/or apoptosis in pattern formation. Dysregulation of apoptosis can 
result in inappropriate, suppression of celldeath, as occurs in the development of some 
cancers, or in failure to control the extent of cell death, as is believed to occur in 
acquired immunodeficiency and certain neurodegenerative disorders, such as spinal 
muscular atrophy (SMA). Therefore, the polynucleotides and polypeptides of the 
present invention are useful in treating, detecting, and/or preventing said disorders 
and conditions, in addition to other types of degenerative conditions. Thus this protein 
may modulate apoptosis or tissue differentiation and would be useful in the detection, 
treatment, and/or prevention of degenerative or proliferative conditions and diseases. 
Protein, as well as, antibodies directed against the protein may show utility as a tumor 
marker and/or immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO: 103 and may have been publicly available prior to conception 
of the present invention. Preferably, such related polynucleotides^are specifically 
excluded from the scope of the present invention. To list every related sequence 
would be cumbersome. Accordingly, preferably excluded from the present invention 
are one or more polynucleotides comprising a nucleotide sequence described by the 
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general formula of a-b, where a is any integer between 1 to 771 of SEQ ID NO: 103, b 
is an integer of 15 to 785, where both a and b correspond to the positions of 
nucleotide residues shown in SEQ ID NO: 103, and where b is greater than or equal to 
a + 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 94 

The gene encoding the disclosed cDNA is believed to reside on chromosome 
10. Accordingly, polynucleotides related to this invention are useful as a marker in 
linkage analysis for chromosome 10. 

This gene is expressed primarily in hypothalamus, T-cells, and adipose tissue. 

Therefore, nucleic acids of the invention are useful as reagents for differential 
identification of the tissue(s) or cell type(s) present in a biological sample and for 
diagnosis of the following diseases and conditions: immune (e.g. immunodeficiencies, 
autoimmunities, inflammation, leukemias & lymphomas) and neurological (e.g. 
Alzheimer's disease, dementia, schizophrenia) disorders. Similarly, polypeptides and 
antibodies directed to those polypeptides are useful to provide immunological probes 
for differential identification of the tissue(s) or cell type(s). For a number of disorders 
of the above tissues or cells, particularly of the central nervous, hematopoietic and 
immune systems, expression of this gene at significantly higher or lower levels may 
be detected in certain tissues (e.g., immune, neural, metabolic, and cancerous and 
wounded tissues) or bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal 
fluid) taken from an individual having such a disorder, relative to the standard gene 
expression level, i.e., the expression level in healthy tissue from an individual not 
having the disorder. The tissue distribution suggests that the protein product of this 
clone would be useful in the intervention or detection of pathologies associated with 
the hematopoietic and immune systems, such as anemias (leukemias). In addition, the 
expression in brain (including fetal) might suggest a role in developmental brain 
defects, neurodegenerative diseases or beha^orafabnom^^ 
Alzheimer's, dementia, depression, etc.). 

Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 
201 as residues: Phe-64 to Gly-77, Pro-83 to Asp-99. 
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The tissue distribution in hypothalamus suggests the protein product of this 
clone is useful for the detection, treatment, and/or prevention of neurodegenerative 
disease states, behavioral disorders, or inflammatory conditions which include, but 
are not limited to Alzheimer's Disease, Parkinson's Disease, Huntington's Disease, 
5 Tourette Syndrome, meningitis, encephalitis, demyelinating diseases, peripheral 
neuropathies, neoplasia, trauma, congenital malformations, spinal cord injuries, 
ischemia and infarction, aneurysms, hemorrhages, schizophrenia, mania, dementia, 
paranoia, obsessive compulsive disorder, depression, panic disorder, learning 
disabilities, ALS, psychoses, autism, and altered behaviors, including disorders in 
10 feeding, sleep patterns, balance, and perception. In addition, elevated expression of 
this gene product in regions of the brain suggests it plays a role in normal neural 
function. Potentially, this gene product is involved in synapse formation, 
neurotransmission, learning, cognition, homeostasis, or neuronal differentiation or 
survival. This gene product may be involved in the regulation of cytokine production, 

1 5 antigen presentation, or other, processes suggesting a usefulness in the treatment of 
cancer (e.g. by boosting immune responses). 

Since the gene is expressed in cells of lymphoid origin, the natural gene 
product may be involved in immune functions. Therefore it may be also used as an 
agent for immunological disorders including arthritis, asthma, immunodeficiency 

20 diseases such as AIDS, leukemia, rheumatoid arthritis, granulomatous disease, 
inflammatory bowel disease, sepsis, acne, neutropenia, neutrophilia, psoriasis, 
hypersensitivities, such as T-cell mediated cytotoxicity; immune reactions to 
transplanted organs and tissues, such as host-versus-graft and graft-versus-host 
diseases, or autoimmunity disorders, such as autoimmune infertility, lense tissue 

25 injury, demyelination, systemic lupus erythematosis, drug induced hemolytic anemia, 
rheumatoid arthritis, Sjogren's disease, scleroderma and tissues. 

Moreover, the protein may represent a secreted factor that influences the 

differentiation or behavior of other blood cells, or th^frecruit^ hematopoietic cells to - 
sites of injury. In addition, this gene product may have commercial utility in the 

30 expansion of stem cells and committed progenitors of various blood lineages, and in 
the differentiation and/or proliferation of various cell types. Moreover, the protein 
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product of this clone is useful for the diagnosis, prevention, and/or treatment of 
various metabolic disorders which include, but are not limited to, Tay-Sachs disease, 
phenylkenonuria, galactosemia, hyperlipidemias, porphyrias, and Hurler's syndrome. 
The protein is useful in the treatment and/or prevention of neurodegenerative 
5 conditions, particularly those which occur secondary to aberrant fatty acid 

metabolism (i.e. defects which affect.the synthesis and integrity of the myelin sheath). 
Protein, as well as, antibodies directed against the protein may show utility as a tumor 
marker and/or immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 

10 available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO: 104 and may have been publicly available prior to conception 
of the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence 
would be cumbersome. Accordingly, preferably excluded from the present invention 

15 are one or more polynucleotides comprising a nucleotide sequence described by the 
general formula of a-b, where a is any integer between 1 to 2001 of SEQ ID NO: 104, 
b is an integer of 15 to 2015, where both a and b correspond to the positions of 
nucleotide residues shown in SEQ ID NO: 104, and where b is greater than or equal to 
a + 14. 

20 

FEATURES OF PROTEIN ENCODED BY GENE NO: 95 

The translation product of this gene was shown to have homology to the 

murine leucine-rich repeat protein (See Genbank Accession No. gil2880079), which is 

thought to be important in neural development. 
25 In specific embodiments, the polypeptides of the invention comprise the 

sequence:QKPTFALGELYPPLINLWEAGKEKSTSLKVKATVIGLPTNMS (SEQ 
ID_NQ:392)^ Polynucleotides encoding this polypeptide are also encompassed by the 

invention. The gene encoding the disclosed cDNA is believed to resid^oiT ~ 

chromosome 7. Accordingly, polynucleotides related to this invention are useful as 
30 a marker in linkage analysis for chromosome 7. 
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It has been discovered that this gene is expressed primarily in T-cells and 

brain. 

Therefore, nucleic acids of the invention are useful as reagents for differential 
identification of the tissue(s) or cell type(s) present in a biological sample and for 
5 diagnosis of the following diseases and conditions: immunodeficiency, tumor 
necrosis, infection, lymphomas, auto-immunities, cancer, inflammation, anemias 
(leukemia) and other hematopoeitic disorders, neurological diseases of the brain such 
as depression, schizophrenia, Alzheimer's disease, Parkinson's disease, Huntington's 
disease, dementia and specific brain tumors. Similarly, polypeptides and antibodies 

10 directed to those polypeptides are useful to provide immunological probes for 

differential identification of the tissue(s) or cell type(s). For a number of disorders of 
the above tissues or cells, particularly of the brain and immune system, expression of 
this gene at significantly higher or lower levels may be detected in certain tissues or 
cell types (e.g., neural, immune, hematopoietic, and cancerous and wounded tissues) 

15 or bodily fluids (e.g., lymph,- amniotic fluid, serum, plasma, urine, synovial' fluid or 
spinal fluid) taken from an individual having such a disorder, relative to the standard 
gene expression level, i.e., the expression level in healthy tissue from an individual 
not having the disorder. 

Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 

20 202 as residues: Met-24 to Gly-29, Ala-57 to Thr-63. 

The tissue distribution in T-cells suggests that the protein product of this clone 
would be useful for the diagnosis and treatment of immune disorders including: 
leukemias, lymphomas, auto-immunities, immunodeficiencies (e.g. AIDS), immuno- 
supressive conditions (transplantation) and hematopoeitic disorders. In addition this 

25 gene product may be applicable in conditions of general microbial infection, 

inflammation or cancer. The expression in brain, combined with the homology to the 

leucine-rich^repe^protein^suggests that the protein product of this clone would be 

useful for the treatment and diagnosis of developmental, degenerative and behavioral "~ 
conditions of the brain and nervous system, such as depression, schizophrenia, 

30 Alzheimer's disease, Parkinson's disease, Huntington's disease, Tourette Syndrome, 
mania, dementia, paranoia, addictive behavior, obsessive-compulsisve disorder and 
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sleep disorders. Protein, as well as, antibodies directed against the protein may show 
utility as a tumor marker and/or immunotherapy targets for the above listed tissues. - 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO.105 and may have been publicly available prior to conception 
of the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence 
would be cumbersome. Accordingly, preferably excluded from the present invention 
are one or more polynucleotides comprising a nucleotide sequence described by.the 
general formula of a-b, where a is any integer between 1 to 353 of SEQ ID NO: 105, b 

is an integer of 15 to 367, where both a and b correspond to the positions of 
nucleotide residues shown in SEQ ID NO:105, and where b is greater than or equal to 
a+14. 
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Table 1 summarizes the information corresponding to each "Gene No." described 
above. The nucleotide sequence identified as "NT SEQ ID NO:X" was assembled 
from partially homologous ("overlapping") sequences obtained from the "cDNA 
clone ID" identified in Table 1 and, in some cases, from additional related DNA 
5 clones. The overlapping sequences were assembled into a single contiguous sequence 
of high redundancy (usually three to five overlapping sequences at each nucleotide 
position), resulting in a final sequence identified as SEQ ID NO:X. 

The cDNA Clone ID was deposited on the date and given the corresponding 
deposit number: listed in "ATCC Deposit No:Z and Date." Some of the deposits 
10 contain multiple different clones corresponding to the same; gene. "Vector" refers to 
the type of vector contained in the cDNA Clone ID. 

"Total NT Seq." refers to the total number of nucleotides in the contig 
identified by "Gene No." The deposited clone may contain all or most of these 
sequences, reflected by the nucleotide position indicated as "5' NT of Clone Seq." 
15 and the "3* NT of Clone Seq." of SEQ ID NO:X. The nucleotide position of SEQ ID 
NO:X of the putative start codon (methionine) is identified as "5* NT of Start Codon." 
Similarly , the nucleotide position of SEQ ID NO:X of the predicted signal sequence 
is identified as "5' NT of First AA of Signal Pep." 

The translated amino acid sequence, beginning with the methionine, is 
20 identified as "AA SEQ ID NO: Y," although other reading frames can also be easily 
translated using known molecular biology techniques. The polypeptides produced by 
these alternative open reading frames are specifically contemplated by the present 
invention. 

The first and last amino acid position of SEQ ID NO: Y of the predicted signal 
25 peptide is identified as "First AA of Sig Pep" and "Last AA of Sig Pep." The 
predicted first amino acid position of SEQ ID NO:Y of the secreted portion is 
identified as "Predicted First AA of Secreted Portion." Finally, the amino acid 

posinonof SEQ_IDNO:Y-of the last-amino aeid-in the open reading frame is 

identified as "Last AA of ORF." 
30 SEQ ID NO:X and the translated SEQ ID NO: Y are sufficiently accurate and 

otherwise suitable for a variety of uses well known in the art and described further 
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below. For instance, SEQ ID NO:X is useful for designing nucleic acid hybridization 
probes that will detect nucleic acid sequences contained in SEQ ID NO:X or the 
cDNA contained in the deposited clone. These probes will also hybridize to nucleic 
acid molecules in biological samples, thereby enabling a variety of forensic and 
diagnostic methods of the invention. Similarly, polypeptides identified from SEQ ID 
NO: Y may be used to generate antibodies which bind specifically to the secreted 
proteins encoded by the cDNA clones identified in Table 1. 

Nevertheless, DNA sequences generated by sequencing reactions can contain 
sequencing errors. The errors exist as misidentified nucleotides, or as insertions or 
deletions of nucleotides in the generated DNA sequence. The erroneously inserted or 
deleted nucleotides cause frame shifts in the reading frames of the predicted amino 
acid sequence. In these cases, the predicted amino acid sequence diverges from the 
actual amino acid sequence, even though the generated DNA sequence may be greater 
than 99.9% identical to the actual DNA sequence (for example, one base insertion or 
deletion in an openreading frame. of over 1000 bases). 

Accordingly, for those applications requiring precision in the nucleotide 
sequence or the amino acid sequence, the present invention provides not only the 
generated nucleotide sequence identified as SEQ ID NO:X and the predicted 
translated amino acid sequence identified as SEQ ID NO:Y, but also a sample of 
plasmid DNA containing a human cDNA of the invention deposited with the ATCC, 
as set forth in Table 1 . The nucleotide sequence of each deposited clone can readily 
be determined by sequencing the deposited clone in accordance with known methods. 
The predicted amino acid sequence can then be verified from such deposits. 
Moreover, the amino acid sequence of the protein encoded by a particular clone can 
also be directly determined by peptide sequencing or by expressing the protein in a 
suitable host cell containing the deposited human cDNA, collecting the protein, and 
determining its sequence. 

The present invention also-relates to the-genes corresponding to SEQ ID 

NO:X, SEQ ID NO: Y, or the deposited clone. The corresponding gene can be 
isolated in accordance with known methods using the sequence information disclosed 
herein. Such methods include preparing probes or primers from the disclosed 
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sequence and identifying or amplifying the corresponding gene from appropriate 
sources of genomic material. 

Also provided in the present invention are species homologs. Species 
homologs may be isolated and identified by making suitable probes or primers from 
the sequences provided herein and screening a suitable nucleic acid source for the 
desired homologue. 

The polypeptides of the invention can be prepared in any suitable manner. 
Such polypeptides include isolated naturally occurring polypeptides, recombinantly 
produced polypeptides, synthetically produced polypeptides, or polypeptides 
produced by a combination of these methods. Means for preparing such polypeptides 
are well understood in the art. 

The polypeptides may be in the form of the secreted protein, including the 
mature form, or may be a part of a larger protein, such as a fusion protein (see below). 
It is often advantageous to include an additional amino acid sequence which contains 
secretory or leader sequences, pro-sequences, sequences which aid in purification , 
such as multiple histidine residues, or an additional sequence for stability during 
recombinant production. 

The polypeptides of the present invention are preferably provided in an 
isolated form, and preferably are substantially purified. A recombinantly produced 
version of a polypeptide, including the secreted polypeptide, can be substantially 
purified by the one-step method described in Smith and Johnson, Gene 67:31-40 
(1988). Polypeptides of the invention also can be purified from natural or 
recombinant sources using antibodies of the invention raised against the secreted 
protein in methods which are well known in the art. 

Signal Sequences 

Methods for predicting, whether a protein has a signal sequence, as well as the 

_ _cleayage„point.for that sequence,-are available^ For instancerthe method of 

McGeoch, Virus Res. 3:271-286 (1985), uses the information from a short N-terminal 
charged region and a subsequent uncharged region of the complete (uncleaved) 
protein. The method of von Heinje, Nucleic Acids Res. 14:4683-4690 (1986) uses the 
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information from the residues surrounding the cleavage site, typically residues -13 to 
+2, where +1 indicates the amino terminus of the secreted protein. The accuracy of 
predicting the cleavage points of known mammalian secretory proteins for each of 
these methods is in the range of 75-80%. (von Heinje, supra.) However, the two 
5 methods do not always produce the same predicted cleavage point(s) for a given 
protein. 

In the present case, the deduced amino acid sequence of the secreted 
polypeptide was analyzed by a computer program called SignalP (Henrik Nielsen et 
al., Protein Engineering 10:1-6 (1997)), which predicts the cellular location of a 

10 protein based on the amino acid sequence. As part of this computational prediction of 
localization, the methods of McGeoch and von Heinje are incorporated. The analysis 
of the amino acid sequences of the secreted proteins described herein by this program 
provided the results shown in Table 1. . 

As one of ordinary skill would appreciate, however, cleavage sites sometimes 

15 vary from organism to prganism and cannot be predicted with absolute certainty. 

Accordingly, the present invention provides secreted polypeptides having a sequence 
shown in SEQ ID NO: Y which have an N-terminus beginning within 5 residues (i.e., 
+ or - 5 residues) of the predicted cleavage point. Similarly, it is also recognized that 
in some cases, cleavage of the signal sequence from a secreted protein is not entirely 

20 uniform, resulting in more than one secreted species. These polypeptides, and the 
polynucleotides encoding such polypeptides, are contemplated by the present 
invention. 

. Moreover, the signal sequence identified by the above analysis may not 
necessarily predict the naturally occurring signal sequence. For example, the 

25 naturally occurring signal sequence may be further upstream from the predicted signal 
sequence. However, it is likely that the predicted signal sequence will be capable of 
directing the secreted protein to the ER. These polypeptides, and the polynucleotides 

encoding such polypeptides, are contemplated by the presenrinventionT 

30 Polynucleotide and Polypeptide Variants 
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"Variant" refers to a polynucleotide or polypeptide differing from the 
polynucleotide or polypeptide of the present invention, but retaining essential 
properties thereof. Generally, variants are overall closely similar, and, in many 
regions, identical to the polynucleotide or polypeptide of the present invention. 

By a polynucleotide having a nucleotide sequence at least, for example, 95% 
"identical" to a reference nucleotide sequence of the present invention, it is intended 
that the nucleotide sequence of the polynucleotide is identical to the reference 
sequence except that the polynucleotide sequence may include up to five point 
mutations per each 100 nucleotides of the reference nucleotide sequence encoding the 
polypeptide. In other words, to obtain a polynucleotide having a nucleotide sequence 
at least 95% identical to a reference nucleotide sequence, up to 5% of the nucleotides 
in the reference sequence may be deleted or substituted with another nucleotide, or a 
number of nucleotides up to 5% of the total nucleotides in the reference sequence may 
be inserted into the reference sequence. The query sequence may be an entire 
sequence shown inTable 1, the ORF (open reading frame), or any fragement specified 
as described herein. 

As a practical matter, whether any particular nucleic acid molecule or 
polypeptide is at least 90%, 95%, 96%, 97%, 98% or 99% identical to a nucleotide 
sequence of the presence invention can be determined conventionally using known 
computer programs. A preferred method for de terming the best overall match 
between a query sequence (a sequence of the present invention) and a subject 
sequence, also referred to as a global sequence alignment, can be determined using 
the FASTDB computer program based on-the algorithm of Brutlag et al. (Comp. App. 
Biosci. (1990) 6:237-245). In a sequence alignment the query and subject sequences 
are both DNA sequences. An RNA sequence can be compared by converting LPs to 
T's. The result of said global sequence alignment is in percent identity. Preferred 
parameters used in a FASTDB alignment of DNA sequences to calculate percent 

_identiy_are:_ Matrix=Unitary, k-tuple=4, Mismatch Penalty=l —Joining Penal ty=30, 

Randomization Group Length=0, Cutoff Score=l, Gap Penalty =5, Gap Size Penalty 
0.05, Window Size=500 or the lenght of the subject nucleotide sequence, whichever is 
shorter. 
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If the subject sequence is shorter than the query sequence because of 5* or 3' 
deletions, not because of internal deletions, a manual correction must be made to the 
results. This is because the FASTDB program does not account for 5' and 3' 
truncations of the subject sequence when calculating percent identity. For subject 
sequences truncated at the 5' or 3' ends, relative to the the query, sequence, the 
percent identity is corrected by calculating the number of bases of the query sequence 
that are 5' and 3' of the subject sequence, which are not matched/aligned, as a percent 
of the total bases of the query sequence. Whether a nucleotide is matched/aligned is 
determined by results of the FASTDB sequence alignment. This percentage is then 
subtracted from the percent identity, calculated by the above FASTDB program using 
the specified parameters, to arrive at a final percent identity score. This corrected 
score is what is used for the purposes of the present invention. Only bases outside the 
5' and 3' bases of the subject sequence, as displayed by the FASTDB alignment, 
which are not matched/aligned with the query sequence, are calculated for the 
purposes of manually adjusting the percent identity score. - •■ . 

For example, a 90 base subject sequence is aligned to a 100 base query 
sequence to determine percent identity. The deletions occur at the 5' end of the 
subject sequence and therefore, the FASTDB alignment does not show a 
matched/alignement of the first 10 bases at 5' end. The 10 unpaired bases represent 
10% of the sequence (number of bases at the 5' and 3' ends not matched/total number 
of bases in the query sequence) so 10% is subtracted from the percent identity score 
calculated by the FASTDB program. If the remaining 90 bases were perfectly 
matched the final percent identity would be 90%. In another example, a 90 base 
subject sequence is compared with a 100 base query sequence. This time the 
deletions are internal deletions so that there are no bases on the 5' or 3' of the subject 
sequence which are not matched/aligned with the query. In this case the percent 
identity calculated by FASTDB is not manually corrected. Once again, only bases 5' 
_and 3'_of the-subject sequence-whieh-are notanatehed/aligned with the query sequnce - 
are manually corrected for. No other manual corrections are to made for the purposes 
of the present invention. 

By a polypeptide having an amino acid sequence at least, for example, 95% 
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"identical" to a query amino acid sequence of the present invention, it is intended that 
the amino acid sequence of the subject polypeptide is identical to the query sequence 
except that the subject polypeptide sequence may include up to five amino acid 
alterations per each 100 amino acids of the query amino acid sequence. In other 
5 words, to obtain a polypeptide having an amino acid sequence at least 95% identical 
to a query amino acid sequence, up to 5% of the amino acid residues in the subject 
sequence may be inserted, deleted, (indels) or substituted with another amino acid. 
These alterations of the reference sequence may occur at the amino or carboxy 
terminal positions of the reference amino acid sequence or anywhere between those 
10 terminal positions, interspersed either individually among residues in the reference 
sequence or in one or more contiguous groups within the reference sequence. 

As a practical matter, whether any particular polypeptide is at least 90%, 95%, 
96%, 97%, 98% or 99% identical to, for instance, the amino acid sequences shown in 
Table 1 or to the amino acid sequence encoded by deposited DNA clone can be 
1 5 determined conventionally using known computer programs. A preferred method for 
determing the best overall match between a query sequence (a sequence of the present 
invention) and a subject sequence, also referred to as a global sequence alignment, 
can be determined using the FASTDB computer program based on the algorithm of 
Brutlag et al. (Comp. App. Biosci. (1990) 6:237-245). In a sequence alignment the 

20 query and subject sequences are either both nucleotide sequences or both amino acid 
sequences. The result of said global sequence alignment is in percent identity. 
Preferred parameters used in a FASTDB amino acid alignment are: Matrix=PAM 0, 
k-tuple=2, Mismatch Penalty=l, Joining Penalty=20, Randomization Group 
Length=0, Cutoff Score=l, Window Size=sequence length, Gap Penalty=5, Gap Size 

25 Penalty =0.05, Window Size=500 or the length of the subject amino acid sequence, 
whichever is shorter. 

If the subject sequence is shorter than the query sequence due to N- or C- 

terminaLdeletions, notbecause of internal-deletions,~a-manual correction must-be - 

made to the results. This is becuase the FASTDB program does not account for N- 

30 and C-terminal truncations of the subject sequence when calculating global percent 
identity. For subject sequences truncated at the N- and C-termini, relative to the the 
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query sequence, the percent identity is corrected by calculating the number of residues 
of the query sequence that are N- and C-terminal of the subject sequence, which are 
not matched/aligned with a corresponding subject residue, as a percent of the total 
bases of the query sequence. Whether a residue is matched/aligned is determined by 
results of the FASTDB sequence alignment. This percentage is then subtracted from 
the percent identity, calculated by the above FASTDB program using the specified 
parameters, to arrive at a final percent identity score. This final percent identity score 
is what is used for the purposes of the present invention. Only residues to the N- and 
C-termini of the subject sequence, which are not matched/aligned with the query 
sequence, are considered for the purposes of manually adjusting the percent identity 
score. That is, only query residue positions outside the farthest N- and C-terminal 
residues of the subject sequence. 

For example, a 90 amino acid residue subject sequence is aligned with a 100 
residue query sequence to determine percent identity. The deletion occurs at the N- 
terminus of the subject sequence and therefore, the FASTDB alignment does not 
show a matching/alignment of the first 10 residues at the N-terminus. The 10 
unpaired residues represent 10% of the sequence (number of residues at the N- and C- 
termini not matched/total number of residues in the query sequence) so 10% is 
subtracted from the percent identity score calculated by the FASTDB program. If the 
remaining 90 residues were perfectly matched the final percent identity would be 
90%. In another example, a 90 residue subject sequence is compared with a 100 
residue query sequence. This time the deletions are internal deletions so there are no 
residues at the N- or C-termini of the subject sequence which are not matched/aligned 
with the query. In this case the percent identity calculated by FASTDB is not 
manually corrected. Once again, only residue positions outside the N- and C-terminal 
ends of the subject sequence, as displayed in the FASTDB alignment, which are not 
matched/aligned with the query sequnce are manually corrected for. No other manual 

.corrections are to-made for the-purposes of the present invention. — ~ 

The variants may contain alterations in the coding regions, non-coding 
regions, or both. Especially preferred are polynucleotide variants containing 
alterations which produce silent substitutions, additions, or deletions, but do not alter 
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the properties or activities of the encoded polypeptide. Nucleotide variants produced 
by silent substitutions due to the degeneracy of the genetic code are preferred. 
Moreover, variants in which 5-10, 1-5, or 1-2 amino acids are substituted, deleted, or 
added in any combination are also preferred. Polynucleotide variants can be produced 
5 for a variety of reasons, e.g., to optimize codon expression for a particular host 

(change codons in the human mRNA to those preferred by a bacterial host such as E. 
coli). 

Naturally occurring variants are called "allelic variants," and refer to one of 
several alternate forms of a gene occupying a given locus on a chromosome of an 

10 organism. (Genes II, Lewin, B., ed., John Wiley & Sons, New York (1985).) These 
allelic variants can vary at either the polynucleotide and/or polypeptide level. 
Alternatively, non-naturally occurring variants may be produced by mutagenesis 
techniques or by direct synthesis. 

Using known methods of protein engineering and recombinant DNA 

15 technology, variants may be generated- to improve or alter the characteristics of the 
polypeptides of the present invention. For instance, one or more amino acids can be 
deleted from the N-terminus or C-terminus of the secreted protein without substantial 
loss of biological function. The authors of Ron et al., J. Biol. Chem. 268: 2984-2988 
(1993), reported variant KGF proteins having heparin binding activity even after 

20 deleting 3, 8, or 27 amino-terminal amino acid residues. Similarly, Interferon gamma 
exhibited up to ten times higher activity after deleting 8-10 amino acid residues from 
the carboxy terminus of this protein. (Dobeli et al., J. Biotechnology 7: 199-216 
(1988).) 

Moreover, ample evidence demonstrates that variants often retain a biological 
25 activity similar to that of the naturally occurring protein. For example, Gayle and 
coworkers (J. Biol. Chem 268:22105-221 1 1 (1993)) conducted extensive mutational 
analysis of human cytokine IL-la. They used random mutagenesis to generate over 

3^00Jiidividual IL-l_ajnutantsjft^ - — 

the entire length of the molecule. Multiple mutations were examined at every 
30 possible amino acid position. The investigators found that "[m]ost of the molecule 
could be altered with little effect on either [binding or biological activity]." (See, 
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Abstract.) In fact, only 23 unique amino acid sequences, out of more than 3,500 
nucleotide sequences examined, produced a protein that significantly differed in 
activity from wild-type. 

Furthermore, even if deleting one or more amino acids from the N-terminus or 
C-terminus of a polypeptide results in modification or loss of one or more biological 
functions, other biological activities may still be retained. For example, the ability of 
a deletion variant to induce and/or to bind antibodies which recagnize the secreted 
form will likely be retained when less than the majority of the residues of the secreted 
form are removed from the N-terminus or C-terminus. Whether a particular 
polypeptide lacking N- or C-terminal residues of a protein retains such immunogenic 
activities can readily be determined by routine methods described herein and 
otherwise known in the art. 

Thus, the invention further includes polypeptide variants which show 
substantial biological activity. Such variants include deletions, insertions, 
inversions, repeats,-and substitutions selected according to general rules known in the 
art so as have little effect on activity. For example, guidance concerning how to make 
phenotypically silent amino acid substitutions is provided in Bowie, J. U. et al., 
Science 247:1306-1310 (1990), wherein the authors indicate that there are two main 
strategies for studying the tolerance of an amino acid sequence to change. 

The first strategy exploits the tolerance of amino acid substitutions by natural 
selection during the process of evolution. By comparing amino acid sequences in 
different species, conserved amino acids can be identified. These conserved amino 
acids are likely important for protein function. In contrast, the amino acid positions 
where substitutions have been tolerated by natural selection indicates that these 
positions are not critical for protein function. Thus, positions tolerating amino acid 
substitution could be modified while still maintaining biological activity of the 
protein. 

The^second.strategyLuses genetic-engineering to introduceaminoacid changes — 

at specific positions of a cloned gene to identify regions critical for protein function. 
For example, site directed mutagenesis or alanine-scanning mutagenesis (introduction 
of single alanine mutations at every residue in the molecule) can be used. 
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(Cunningham and Wells, Science 244:1081-1085 (1989).) The resulting mutant 
molecules can then be tested for biological activity. 

As the authors state, these two strategies have revealed that proteins are 
surprisingly tolerant of amino acid substitutions. The authors further indicate which 
5 amino acid changes are likely to be permissive at certain amino acid positions in the 
protein. For example, most buried (within the tertiary structure of the protein) amino 
acid residues require nonpolar side chains, whereas few features of surface side chains 
are generally conserved. Moreover, tolerated conservative amino acid substitutions 
involve replacement of the aliphatic or hydrophobic amino acids Ala, Val, Leu and 
10 He; replacement of the hydroxyl residues Ser and Thr; replacement of the acidic 

residues Asp and Glu; replacement of the amide residues Asn and Gin, replacement of 
the basic residues Lys, Arg, and His; replacement of the aromatic residues Phe, Tyr, 
and Trp, and replacement of the small-sized amino acids Ala, Ser, Thr, Met, and Gly. 

15 Besides conservative amino acid substitution, variants of the present invention 

include (i) substitutions with one or more of the non-conserved amino acid residues, 
where the substituted amino acid residues may or may not be one encoded by the 
genetic code, or (ii) substitution with one or more of amino acid residues having a 
substituent group, or (iii) fusion of the mature polypeptide with another compound, 

20 such as a compound to increase the stability and/or solubility of the polypeptide (for 
example, polyethylene glycol), or (iv) fusion of the polypeptide with additional amino 
acids, such as an IgG Fc fusion region peptide, or leader or secretory sequence, or a 
sequence facilitating purification. Such variant polypeptides are deemed to be within 
the scope of those skilled in the art from the teachings herein. 

25 For example, polypeptide variants containing amino acid substitutions of 

charged amino acids with other charged or neutral amino acids may produce proteins 
with improved characteristics, such as less aggregation. Aggregation of 

pharmaceutical-formulations both reduces activity and increases cleararic^due to the 

aggregate's immunogenic activity. (Pinckard et al., Clin. Exp. Immunol. 2:331-340 

30 (1967); Robbins et al., Diabetes 36: 838-845 (1987); Cleland et al., Crit. Rev. 
Therapeutic Drug Carrier Systems 10:307-377 (1993).) 
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A further embodiment of the invention relates to a polypeptide which 
comprises the amino acid sequence of the present invention having an amino acid 
sequence which contains at least one amino acid substitution, but not more than 50 
amino acid substitutions, even more preferably, not more than 40 amino acid 
5 substitutions, still more preferably, not more than 30 amino acid substitutions, and 
still even more preferably, not more than 20 amino acid substitutions. Of course, in 
order of ever-increasing preference, it is highly preferable for a polypeptide to have 
an amino acid sequence which comprises the amino acid sequence of the present 
invention, which contains at least one, but not more than 10, 9, 8, 7, 6, 5, 4, 3, 2 or 1 
10 amino acid substitutions. In specific embodiments, the number of additions, 
substitutions, and/or deletions in the amino acid sequence of the present invention or 
fragments thereof (e.g., the mature form and/or other fragments described herein), is 
1-5, 5-10, 5-25, 5-50, 10-50 or 50-150, conservative amino acid substitutions are 
preferable. 

15 

Polynucleotide and Polypeptide Fragments 

In the present invention, a "polynucleotide fragment" refers to a short 
polynucleotide having a nucleic acid sequence contained in the deposited clone or 
shown in SEQ ID NO:X. The short nucleotide fragments are preferably at least about 

20 15 nt, and more preferably at least about 20 nt, still more preferably at least about 30 
nt, and even more preferably, at least about 40 nt in length. A fragment "at least 20 nt 
in length," for example, is intended to include 20 or more contiguous bases from the 
cDN A sequence contained in the deposited clone or the nucleotide sequence shown in 
SEQ ID NO:X. These nucleotide fragments are useful as diagnostic probes and 

25 primers as discussed herein. Of course, larger fragments (e.g., 50, 150, 500, 600, 
2000 nucleotides) are preferred. 

Moreover, representative examples of polynucleotide fragments of the 

invention, include, for example, fragments having a sequence from jibputjiucleoiide. 

number 1-50, 51-100, 101-150, 151-200, 201-250, 251-300, 301-350, 351-400, 401- 

30 450, 451-500, 50.1-550, 551-600, 651-700, 701-750, 751-800, 800-850, 851-900, 901- 
950, 951-1000, 1001-1050, 1051-1100, 1101-1150, 1151-1200, 1201-1250, 1251- 
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1300, 1301-1350, 1351-1400, 1401-1450, 1451-1500, 1501-1550, 1551-1600, 1601- 
1650, 1651-1700, 1701-1750, 1751-1800, 1801-1850, 1851-1900, 1901-1950, 1951- 
2000, or 2001 to the end of SEQ ID NO:X or the cDNA contained in the deposited 
clone. In this context "about" includes the particularly recited ranges, larger or 
5 smaller by several (5, 4, 3, 2, or 1) nucleotides, at either terminus or at both termini. 
Preferably, these fragments encode a polypeptide which has biological activity. More 
preferably, these polynucleotides can be used as probes or primers as discussed 
herein. 

In the present invention, a "polypeptide fragment" refers to a short amino acid 

10 sequence contained in SEQ ID NO:Y or encoded by the cDNA contained in the 
deposited clone. Protein fragments may be "free-standing," or comprised within a 
larger polypeptide of which the fragment forms a part or region, most preferably as a 
single continuous region. Representative examples of polypeptide fragments of the 
invention, include, for example, fragments from about amino acid number 1-20, 21- 

15 40, 41-60, 61-80, 8-1-100, 102-120 r ;1'2M40, 141-160, or 161 to the end of the coding 
region. Moreover, polypeptide fragments can be about 20, 30, 40, 50, 60, 70, 80, 90, 
100, 110, 120, 130, 140, or 150 amino acids in length. In this context "about" 
includes the particularly recited ranges, larger or smaller by several (5, 4, 3, 2, or 1) 
amino acids, at either extreme or at both extremes. 

20 Preferred polypeptide fragments include the secreted protein as well as the 

mature form. Further preferred polypeptide fragments include the secreted protein or 
the mature form having a continuous series of deleted residues from the amino or the 
carboxy terminus, or both. For example, any number of amino acids, ranging from 1 - 
60, can be deleted from the amino terminus of either the secreted polypeptide or the 

25 mature form. Similarly, any number of amino acids, ranging from 1-30, can be 
deleted from the carboxy terminus of the secreted protein or mature form. 
Furthermore, any combination of the above amino and carboxy terminus deletions are 

prrfeired. ^imilarly, polynucleotide fragments encoding these polypeptide fragments 

are also preferred. 

30 Also preferred are polypeptide and polynucleotide fragments characterized by 

structural or functional domains, such as fragments that comprise alpha-helix and 
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alpha-helix forming regions, beta-sheet and beta-sheet-forming regions, turn and turn- 
forming regions, coil and coil-forming regions, hydrophilic regions, hydrophobic 
regions, alpha amphipathic regions, beta amphipathic regions, flexible regions, 
surface-forming regions, substrate binding region, and high antigenic index regions. 
5 Polypeptide fragments of SEQ ID NO: Y falling within conserved domains are 
specifically contemplated by the present invention. Moreover, polynucleotide 
fragments encoding these domains are also contemplated. 

Other preferred fragments are biologically active fragments. Biologically 
active fragments are those exhibiting activity similar, but not necessarily identical, to 
10 an activity of the polypeptide of the present invention. The biological activity of the 
fragments may include an improved desired activity, or a decreased undesirable 
activity. 

Epitopes & Antibodies 

15 In the present, invention, ■ "-epitopes"- refer to polypeptide fragments having 

antigenic or immunogenic activity in an animal, especially in a human. A preferred 
embodiment of the present invention relates to a polypeptide fragment comprising an 
epitope, as well as the polynucleotide encoding this fragment. A region of a protein 
molecule to which an antibody can bind is defined as an "antigenic epitope." In 

20 contrast, an "immunogenic epitope" is defined as a part of a protein that elicits an 
antibody response. (See, for instance, Geysen et al., Proc. Natl. Acad. Sci. USA 
81:3998-4002(1983).) 

Fragments which function asepitopes may. be produced by any conventional 

means. (See, e.g., Houghten, R. A., Proc. Natl. Acad. Sci. USA 82:5131-5135 (1985) 
25 further described in U.S . Patent No. 4,63 1,211.) 

In the present invention, antigenic epitopes preferably contain a sequence of at 
least seven, more preferably at least nine, and most preferably between about 15 to 

abq^^juiiinojLcMs 

monoclonal antibodies, that specifically bind the epitope. (See, for instance, Wilson 
30 et al., Cell 37:767-778 (1984); Sutcliffe, J. G. et al., Science 219:660-666 (1983).) 
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Similarly, immunogenic epitopes can be used to induce antibodies according 
to methods well known in the art. (See, for instance, Sutcliffe et aL, supra; Wilson et 
al. f supra; Chow, M. et al., Proc. Natl. Acad. Sci. USA 82:910-914; and Bittle, F. J. et 
aL, J, Gen. Virol. 66:2347-2354 (1985).) A preferred immunogenic epitope includes 
the secreted protein. The immunogenic epitopes may be presented together with a 
- carrier protein, such as an albumin, to-an animal system (such as rabbit or mouse) or, 
if it is long enough (at least about 25 amino acids), without a carrier. However, 
immunogenic epitopes comprising as few as 8 to 10 amino acids have been shown to 
be sufficient to raise antibodies capable of binding to, at the very least, linear epitopes 
in a denatured polypeptide (e.g., in Western blotting.) 

As used herein, the term "antibody" (Ab) or "monoclonal antibody" (Mab) is 
meant to include intact molecules as well as antibody fragments (such as, for 
example, Fab and F(ab')2 fragments) which are capable of specifically binding to 
protein. Fab and F(ab')2 fragments lack the Fc fragment of intact antibody, clear 
more rapidly from-the circulation, and-may have less non-specific tissue binding than 
an intact antibody. (Wahl et al., J. NucL Med. 24:316-325 (1983).) Thus, these 
fragments are preferred, as well as the products of a FAB or other immunoglobulin 
expression library. Moreover, antibodies of the present invention include chimeric, 
single chain, and humanized antibodies. 

Fusion Proteins 

Any polypeptide of the present invention can be used to generate fusion 
proteins. For example, the polypeptide of the present invention, when fused to a 
second protein, can be used as an antigenic tag. Antibodies raised against the 
polypeptide of the present invention can be used to indirectly detect the second 
protein by binding to the polypeptide. Moreover, because secreted proteins target 
cellular locations based on trafficking signals, the polypeptides of the present 
invention-can be used as targeting molecules dnce fused to other proteins. 

Examples of domains that can be fused to polypeptides of the present 
invention include not only heterologous signal sequences, but also other heterologous 
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functional regions. The fusion does not necessarily need to be direct, but may occur 
through linker sequences. 

Moreover, fusion proteins may also be engineered to improve characteristics 
of the polypeptide of the present invention. For instance, a region of additional amino 
5 acids, particularly charged amino acids, may be added to the N-terminus of the 

polypeptide to improve stability and persistence during purification from the host cell 
or subsequent handling and storage. Also, peptide moieties may be added to the 
polypeptide to facilitate purification. Such regions may be removed prior to final 
preparation of the polypeptide. The addition of peptide moieties to facilitate handling 

10 of polypeptides are familiar and routine techniques in the art. 

Moreover, polypeptides of the present invention, including fragments, and 
specifically epitopes, can be combined with parts of the constant domain of 
immunoglobulins (IgG), resulting in chimeric polypeptides. These fusion proteins 
facilitate purification and show an increased half-life in vivo. One reported example 

15 describes chimeric proteins consisting of the first two domains of the human CD4- 
polypeptide and various domains of the constant regions of the heavy or light chains 
of mammalian immunoglobulins. (EP A 394,827; Traunecker et al., Nature 331:84- 
86 (1988).) Fusion proteins having disulfide-linked dimeric structures (due to the 
IgG) can also be more efficient in binding and neutralizing other molecules, than the 

20 monomelic secreted protein or protein fragment alone. (Fountoulakis et al., J. 
Biochem. 270:3958-3964 (1995).) 

Similarly, EP-A-O 464 533 (Canadian counterpart 2045869) discloses fusion 
proteins comprising various portions of constant region of immunoglobulin molecules 
together with another human protein or part thereof. In many cases, the Fc part in a 

25 fusion protein is beneficial in therapy and diagnosis, and thus can result in, for 

example, improved pharmacokinetic properties. (EP-A 0232 262.) Alternatively, 
deleting the Fc part after the fusion protein has been expressed, detected, and purified, 
would be "desired. For ex^pieT the F^^oftiolTma^hihder therapy and diaghosuTif 
the fusion protein is used as an antigen for immunizations. In drug discovery, for 

30 example, human proteins, such as hIL-5, have been fused with Fc portions for the 
purpose of high-throughput screening assays to identify antagonists of hIL-5. (See, 
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D. Bennett et al., J. Molecular Recognition 8:52-58 (1995); K. Johanson et al., J. Biol. 
Chem. 270:9459-9471 (1995).) 

Moreover, the polypeptides of the present invention can be fused to marker 
sequences, such as a peptide which facilitates purification of the fused polypeptide. 
In preferred embodiments, the marker amino acid sequence is a hexa-histidine 
peptide, such as the tag provided in a pQE vector (QIAGEN, Inc., 9259 Eton Avenue, 
Chatsworth, CA, 9131 1), among others, many of which are commercially available. 
As described in Gentz et al., Proc. Natl. Acad. Sci. USA 86:821-824 (1989), for 
instance, hexa-histidine provides for convenient purification of the fusion protein. 
Another peptide tag useful for purification, the "HA" tag, corresponds to an epitope 
derived from the influenza hemagglutinin protein. (Wilson et al., Cell 37:767 
(1984).) 

Thus, any of these above fusions can be engineered using the polynucleotides 
or the polypeptides of the present invention. 

Vectors. Host Cells, and Protein Production 

The present invention also relates to vectors containing the polynucleotide of 
the present invention, host cells, and the production of polypeptides by recombinant 
techniques. The vector may be, for example, a phage, plasmid, viral, or retroviral 
vector. Retroviral vectors may be replication competentor replication defective. In 
the latter case, viral propagation generally will occur only in complementing host 
cells. 

The polynucleotides may be joined to a vector containing a selectable marker 
for propagation in a host. Generally, a plasmid vector is introduced in a precipitate, 
such as a calcium phosphate precipitate, or in a complex with a charged lipid. If the 
vector is a virus, it may be packaged in vitro using an appropriate packaging cell line 
and then transduced into host cells. 

The polynucleotide insert should be operati vely linkedTo an" appropriate 

promoter, such as the phage lambda PL promoter, the E. coli lac, trp, phoA and tac 
promoters, the SV40 early and late promoters and promoters of retroviral LTRs, to 
name a few. Other suitable promoters will be known to the skilled artisan. The 
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expression constructs will further contain sites for transcription initiation, termination, 
and, in the transcribed region, a ribosome binding site for translation. The coding 
portion of the transcripts expressed by the constructs will preferably include a 
translation initiating codon at the beginning and a termination codon (UAA, UGA or 
5 UAG) appropriately positioned at the end of the polypeptide to be translated. 

As indicated, the expression vectors will preferably include at least one 
selectable marker. Such markers include dihydrofolate reductase, G418 or neomycin 
resistance for eukaryotic cell culture and tetracycline, kanamycin or ampicillin 
resistance genes for culturing in E. coli and other bacteria. Representative examples 

10 of appropriate hosts include, but are not limited to, bacterial cells, such as E. coli, 
Streptomyces and Salmonella typhimurium cells; fungal cells, such as yeast cells; 
insect cells such as Drosophila S2 and Spodoptera Sf9 cells; animal cells such as 
CHO, COS, 293, and Bowes melanoma cells; and plant cells. Appropriate culture 
mediums and conditions for the above-described host cells are known in the art. 

1 5 Among vectors preferred for use in bacteria include pQE70, pQE60 and pQE- 

9, available from QIAGEN, Inc.; pBluescript vectors, Phagescript vectors, pNH8A, 
pNH16a, pNH18A, pNH46A, available from Stratagene Cloning Systems, Inc.; and 
ptrc99a, pKK223-3, pKK233-3, pDR540, pRIT5 available from Pharmacia Biotech, 
Inc. Among preferred eukaryotic vectors are pWLNEO, pS V2CAT, pOG44, pXTl 

20 and pSG available from Stratagene; and pSVK3, pBPV, pMSG and pSVL available 
from Pharmacia. Other suitable vectors will be readily apparent to the skilled artisan. 

Introduction of the construct into the host cell can be effected by calcium 
phosphate transfection, DEAE-dextran mediated transfection, cationic lipid-mediated 
transfection, electroporation, transduction, infection, or other methods. Such methods 

25 are described in many standard laboratory manuals, such as Davis et al., Basic 
Methods In Molecular Biology (1986). It is specifically contemplated that the 
polypeptides of the present invention may in fact be expressed by a host cell lacking a 

recombinant vector. 

A polypeptide of this invention can be recovered and purified from 

30 recombinant cell cultures by well-known methods including ammonium sulfate or 
ethanol precipitation, acid extraction, anion or cation exchange chromatography, 
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phosphocellulose chromatography, hydrophobic interaction chromatography, affinity 
chromatography, hydroxylapatite chromatography and lectin chromatography. Most 
preferably, high performance liquid chromatography ("HPLC") is employed for 
purification. 

5 Polypeptides of the present invention, and preferably the secreted form, can 

also be recovered from: products purified from natural sources, including bodily 
fluids, tissues and cells, whether directly isolated or cultured; products of chemical 
synthetic procedures; and products produced by recombinant techniques from a 
prokaryotic or eukaryotic host, including, for example, bacterial, yeast, higher plant, 

10 insect, and mammalian cells. Depending upon the host employed in a recombinant 
production procedure, the polypeptides of the present invention may be glycosylated 
or may be non-glycosylated. In addition, polypeptides of the invention may also 
include an initial modified methionine residue, in some cases as a result of host- 
mediated processes. Thus, it is well known in the art that the N-terminal methionine 

15 encoded by the translation initiation codon generally is removed with high efficiency 
from any protein after translation in all eukaryotic cells. While the N-terminal 
methionine on most proteins also is efficiently removed in most prokaryotes, for some 
proteins, this prokaryotic removal process is inefficient, depending on the nature of 
the amino acid to which the N-terminal methionine is covalently linked. 

20 In addition to encompassing host cells containing the vector constructs 

discussed herein, the invention also encompasses primary, secondary, and 
immortalized host cells of vertebrate origin, particularly mammalian origin, that have 
been engineered to delete or replace endogenous genetic material (e.g., coding 
sequence), and/or to include genetic material (e.g., heterologous polynucleotide 

25 sequences) that is operably associated with the polynucleotides of the invention, and 
which activates, alters, and/or amplifies endogenous polynucleotides. For example, 
techniques known in the art may be used to operably associate heterologous control 

regions (e.g., -promoter and/or-enhancer)-andendogenous~polyn^^ 

via homologous recombination (see, e.g., U.S. Patent No. 5,641,670, issued June 24, 

30 1997; International Publication No. WO 96/2941 1, published September 26, 1996; 
International Publication No. WO 94/12650, published August 4, 1994; Koller et al., 
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Proc. Natl. Acad. Sci. USA 86:8932-8935 (1989); and Zijlstra et al. t Nature 342:435- 
438 (1989), the disclosures of each of which are incorporated by reference in their 
entireties). 



5 

Uses of the Polynucleotides 

Each of the polynucleotides identified herein can be used in numerous ways as 
reagents. The following description should be considered exemplary and utilizes 
known techniques. 

10 The polynucleotides of the present invention are useful for chromosome 

identification. There exists an ongoing need to identify new chromosome markers, 
since few chromosome marking reagents, based on actual sequence data (repeat 
polymorphisms), are presently available. Each polynucleotide of the present 
invention can be used as a chromosome marker. 

15 Briefly, sequences can, be mapped to chromosomes by preparing PCR primers 

(preferably 1 5-25 bp) from the sequences shown in SEQ ID NO:X. Primers can be 
selected using computer analysis so that primers do not span more than one predicted 
exon in the genomic DNA. These primers are then used for PCR screening of 
somatic cell hybrids containing individual human chromosomes. Only those hybrids 

20 containing the human gene corresponding to the SEQ ID NO:X will yield an 
amplified fragment. 

Similarly, somatic hybrids provide a rapid method of PCR mapping the 
polynucleotides to particular chromosomes. Three or more clones can be assigned per 
day using a single thermal cycler. Moreover, sublocalization of the polynucleotides 

25 can be achieved with panels of specific chromosome fragments. Other gene mapping 
strategies that can be used include in situ hybridization, prescreening with labeled 
flow-sorted chromosomes, and preselection by hybridization to construct 

chromosome specific=cDNA-librarieSr 

Precise chromosomal location of the polynucleotides can also be achieved 

30 using fluorescence in situ hybridization (FISH) of a metaphase chromosomal spread. 
This technique uses polynucleotides as short as 500 or 600 bases; however, 
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polynucleotides 2,000-4,000 bp are preferred. For a review of this technique, see 
Verma et al., "Human Chromosomes: a Manual of Basic Techniques," Pergamon 
Press, New York (1988). 

For chromosome mapping, the polynucleotides can be used individually (to 
mark a single chromosome or a single site on that chromosome) or in panels (for 
marking multiple sites and/or multiple chromosomes). Preferred polynucleotides 
correspond to the noncoding regions of the cDNAs because the coding sequences are 
more likely conserved within gene families, thus increasing the chance of cross 
hybridization during chromosomal mapping. 

Once a polynucleotide has been mapped to a precise chromosomal location, 
the physical position of the polynucleotide can be used in linkage analysis. Linkage 
analysis establishes coinheritance between a chromosomal location and presentation 
of a particular disease. (Disease mapping data are found, for example, in V. 
McKusick, Mendelian Inheritance in Man (available on line through Johns Hopkins 
University Welch Medical Library) .) "Assuming 1 megabase mapping resolution and 
one gene per 20 kb, a cDNA precisely localized to a chromosomal region associated 
with the disease could be one of 50-500 potential causative genes. 

Thus, once coinheritance is established, differences in the polynucleotide and 
the corresponding gene between affected and unaffected individuals can be examined. 
First, visible structural alterations in the chromosomes, such as deletions or 
translocations, are examined in chromosome spreads or by PCR. If no structural 
alterations exist, the presence of point mutations are ascertained. Mutations observed 
in some or all affected individuals, but not in normal individuals, indicates that the 
mutation may cause the disease. However, complete sequencing of the polypeptide 
and the corresponding gene from several normal individuals is required to distinguish 
the mutation from a polymorphism. If a new polymorphism is identified, this 
polymorphic polypeptide can be used for further linkage analysis. 

Fuithennore, increased. or decreased-expressionof thegeneiiTaffecteci 

individuals as compared to unaffected individuals can be assessed using 
polynucleotides of the present invention. Any of these alterations (altered expression, 
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chromosomal rearrangement, or mutation) can be used as a diagnostic or prognostic 
marker. 

In addition to the foregoing, a polynucleotide can be used to control gene 
expression through triple helix formation or antisense DNA or RNA. Both methods 
5 rely on binding of the polynucleotide to DNA or RNA. For these techniques, 

preferred polynucleotides are usually 20 to 40 bases in length and complementary to 
either the region of the gene involved in transcription (triple helix - see Lee et al., 
Nucl. Acids Res. 6:3073 (1979); Cooney et al., Science 241:456 (1988); and Dervan 
et al., Science 251:1360 (1991) ) or to the mRNA itself (antisense - Okano, J. 

10 Neurochem. 56:560 (1991); Oligodeoxy-nucleotides as Antisense Inhibitors of Gene 
Expression, CRC Press, Boca Raton, FL (1988).) Triple helix formation optimally 
results in a shut-off of RNA transcription from DNA, while antisense RNA 
hybridization blocks translation of an mRNA molecule into polypeptide. Both 
techniques are effective in model systems, and the information disclosed herein can 

15 be used to design antisense or triple helix polynucleotides in an effort to treat disease. 

Polynucleotides of the present invention are also useful in gene therapy. One 
goal of gene therapy is to insert a normal gene into an organism having a defective 
gene, in an effort to correct the genetic defect. The polynucleotides disclosed in the 
present invention offer a means of targeting such genetic defects in a highly accurate 

20 manner. Another goal is to insert a new gene that was not present in the host genome, 
thereby producing a new trait in the host cell. 

The polynucleotides are also useful for identifying individuals from minute 
biological samples.. The United States military, for example, is considering the use of 
restriction fragment length polymorphism (RFLP) for identification of its personnel. 

25 In this technique, an individual's genomic DNA is digested with one or more 
restriction enzymes, and probed on a Southern blot to yield unique bands for 
identifying personnel. This method does not suffer from the current limitations of 

_"Dog_Tagsl which can be lost, switehedror stolenrmaking positiATeTdentificauoh 

difficult. The polynucleotides of the present invention can be used as additional DNA 

30 markers for RFLP. 
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The polynucleotides of the present invention can also be used as an alternative 
to RFLP, by determining the actual base-by-base DNA sequence of selected portions 
of an individual's genome. These sequences can be used to prepare PCR primers for 
amplifying and isolating such selected DNA, which can then be sequenced. Using 
this technique, individuals can be identified because each individual will have a 
unique set of DNA sequences. Once an unique ID database is established for an 
individual, positive identification of that individual, living or dead, can be made from 
extremely small tissue samples. 

Forensic biology also benefits from using DNA-based identification 
techniques as disclosed herein. DNA sequences taken from very small biological 
samples such as tissues, e.g., hair or skin, or body fluids, e.g., blood, saliva, semen, 
etc., can be amplified using PCR. In one prior art technique, gene sequences 
amplified from polymorphic loci, such as DQa class II HLA gene, are used in forensic 
biology to identify individuals. (Erlich, H., PCR Technology, Freeman and Co. 
(1992).) Once these~specific polymorphic loci are amplified, they are digested with 
one or more restriction enzymes, yielding an identifying set of bands on a Southern 
blot probed with DNA corresponding to the DQa class II HLA gene. Similarly, 
polynucleotides of the present invention can be used as polymorphic markers for 
forensic purposes. 

There is also a need for reagents capable of identifying the source of a 
particular tissue. Such need arises, for example, in forensics when presented with 
tissue of unknown origin. Appropriate reagents can comprise, for example, DNA 
probes or primers specific to particular tissue prepared from the sequences of the 
present invention. Panels of such reagents can identify tissue by species and/or by 
organ type. In a similar fashion, these reagents can be used to screen tissue cultures 
for contamination. 

In the very least, the polynucleotides of the present invention can be used as 
^molecular weight-markers on Southern-gelsras diagnostic'prdbes for th<T]^sence~of aT 
specific mRNA in a particular cell type, as a probe to "subtract-out" known sequences 
in the process of discovering novel polynucleotides, for selecting and making 
oligomers for attachment to a "gene chip" or other support, to raise anti-DNA 
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antibodies using DNA immunization techniques, and as an antigen to elicit an 
immune response. 

Uses of the Polypeptides 

5 Each of the polypeptides identified herein can be used in numerous ways. The 

following description should be considered exemplary and utilizes known techniques. 

A polypeptide of the present invention can be used to assay protein levels in a 
biological sample using antibody-based techniques. For example, protein expression 
in tissues can be studied with classical immunohistological methods. (Jalkanen, M., 

10 et al. t J. Cell. Biol. 101:976-985 (1985); Jalkanen, M., et al., J. Cell . Biol. 105:3087- 
3096 (1987).) Other antibody-based methods useful for detecting protein gene 
expression include immunoassays, such as the enzyme linked immunosorbent assay 
(ELIS A) and the radioimmunoassay (RIA). Suitable antibody assay labels are known 
in the art and include enzyme labels, such as, glucose oxidase, and radioisotopes, such 

15 as iodine (1251, 1211), carbon (14C), sulfur (35S), tritium (3H), indium (1 12In), and 
technetium (99mTc), and fluorescent labels, such as fluorescein and rhodamine, and 
biotin. 

In addition to assaying secreted protein levels in a biological sample, proteins 
can also be detected in vivo by imaging. Antibody labels or markers for in vivo 

20 imaging of protein include those detectable by X-radiography, NMR or ESR. For X- 
radiography, suitable labels include radioisotopes such as barium or cesium, which 
emit detectable radiation but are not overtly harmful to the subject. Suitable markers 
for NMR and ESR include those with a detectable characteristic spin, such as 
deuterium, which may be incorporated into the antibody by labeling of nutrients for 

25 the relevant hybridoma. 

A protein-specific antibody or antibody fragment which has been labeled with 
an appropriate detectable imaging moiety, such as a radioisotope (for example, 1311, 

J 12In,_99mTc),_a-radio=opaque-substanee r or-a material detectable by nuclear 

magnetic resonance, is introduced (for example, parenterally, subcutaneously, or 

30 intraperitoneally) into the mammal. It will be understood in the art that the size of the 
subject and the imaging system used will determine the quantity of imaging moiety 
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needed to produce diagnostic images. In the case of a radioisotope moiety, for a 
human subject, the quantity of radioactivity injected will normally range from about 5 
to 20 millicuries of 99mTc. The labeled antibody or antibody fragment will then 
preferentially accumulate at the location of cells which contain the specific protein. 
In vivo tumor imaging is described in S.W. Burchiel et al., "Immunopharmacokinetics 
of Radiolabeled Antibodies and their Fragments." (Chapter 13 in Tumor Imaging: 
The Radiochemical Detection of Cancer, S.W. Burchiel and B. A. Rhodes, eds., 
Masson Publishing Inc. (1982).) 

Thus, the invention provides a diagnostic method of a disorder, which 
involves (a) assaying the expression of a polypeptide of the present invention in cells 
or body fluid of an individual; (b) comparing the level of gene expression with a 
standard gene expression level, whereby an increase or decrease in the assayed 
polypeptide gene expression level compared to the standard expression level is 
indicative of a disorder. 

Moreover, polypeptides of the present invention can be used to treat disease. 
For example, patients can be administered a polypeptide of the present invention in an 
effort to replace absent or decreased levels of the polypeptide (e.g., insulin), to 
supplement absent or decreased levels of a different polypeptide (e.g., hemoglobin S 
for hemoglobin B), to inhibit the activity of a polypeptide (e.g., an oncogene), to 
activate the activity of a polypeptide (e.g., by binding to a receptor), to reduce the 
activity of a membrane bound receptor by competing with it for free ligand (e.g., 
soluble TNF receptors used in reducing inflammation), or to bring about a desired 
response (e.g., blood vessel growth). 

Similarly, antibodies directed to a polypeptide of the present invention can 
also be used to treat disease. For example, administration of an antibody directed to a 
polypeptide of the present invention can bind and reduce overproduction of the 
polypeptide. Similarly, administration of an antibody can activate the polypeptide, 
such as by binding-to-a-polypeptide bound to a membrane (receptor): 

At the very least, the polypeptides of the present invention can be used as 
molecular weight markers on SDS-PAGE gels or on molecular sieve gel filtration 
columns using methods well known to those of skill in the art. Polypeptides can also 
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be used to raise antibodies, which in turn are used to measure protein expression from 
a recombinant cell, as a way of assessing transformation of the host cell. Moreover, 
the polypeptides of the present invention can be used to test the following biological 
activities. 

Biological Activities 

The polynucleotides and polypeptides of the present invention can be used in 
assays to test for one or more biological activities. If these polynucleotides and 
polypeptides do.exhibit activity in a particular assay, it is likely that these molecules 
may be involved in the diseases associated with the biological activity. Thus, the 
polynucleotides and polypeptides could be used to treat the associated disease. 

Immune Activity 

A polypeptide or polynucleotide of the present invention may be useful in 
treating deficiencies or disorders of the immune system, by activating or inhibiting the 
proliferation, differentiation, or mobilization (chemotaxis) of immune cells. Immune 
cells develop through a process called hematopoiesis, producing myeloid (platelets, 
red blood cells, neutrophils, and macrophages) and lymphoid (B and T lymphocytes) 
cells from pluripotent stem cells. The etiology of these immune deficiencies or 
disorders may be genetic, somatic, such as cancer or some autoimmune disorders, 
acquired (e.g., by chemotherapy or toxins), or infectious. Moreover, a polynucleotide 
or polypeptide of the present invention can be used as a marker or detector of a 
particular immune system disease or disorder. 

A polynucleotide or polypeptide of the present invention may be useful in 
treating or detecting deficiencies or disorders of hematopoietic cells. A 
polypeptide or polynucleotide of the present invention could be used to increase 
differentiation and proliferation of hematopoietic cells, including the pluripotent stem 
-cellSf-in an-ef f ort-to treat thosedisorders associated witlT a decrease in certahTfor 
many) types hematopoietic cells. Examples of immunologic deficiency syndromes 
include, but are not limited to: blood protein disorders (e.g. agammaglobulinemia, 
dysgammaglobulinemia), ataxia telangiectasia, common variable immunodeficiency, 
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Digeorge Syndrome, HIV infection, HTLV-BLV infection, leukocyte adhesion 
deficiency syndrome, lymphopenia, phagocyte bactericidal dysfunction, severe 
combined immunodeficiency (SCIDs), WiskottrAldrich Disorder, anemia, 
thrombocytopenia, or hemoglobinuria. 

Moreover, a polypeptide or polynucleotide of the present invention could also 
be used to modulate hemostatic (the stopping of bleeding) or thrombolytic activity 
(clot formation). For example, by increasing hemostatic or thrombolytic activity, a 
polynucleotide or polypeptide of the present invention could be used to treat blood 
coagulation disorders (e.g., afibrinogenemia, factor deficiencies), blood platelet 
disorders (e.g. thrombocytopenia), or wounds resulting from trauma, surgery, or other 
causes. Alternatively, a polynucleotide or polypeptide of the present invention that 
can decrease hemostatic or thrombolytic activity could be used to inhibit or dissolve 
clotting. These molecules could be important in the treatment of heart attacks 
(infarction), strokes, or scarring. 

A polynucleotide or polypeptide of the present invention may also be useful in 
treating or detecting autoimmune disorders. Many autoimmune disorders result from 
inappropriate recognition of self as foreign material by immune cells. This 
inappropriate recognition results in an immune response leading to the destruction of 
the host tissue. Therefore, the administration of a polypeptide or polynucleotide of the 
present invention that inhibits an immune response, particularly the proliferation, 
differentiation, or chemotaxis of T-cells, may be an effective therapy in preventing 
autoimmune disorders. 

. Examples of autoimmune disorders that can be treated or detected by the 
present invention include, but are not limited to: Addison's Disease, hemolytic 
anemia, antiphospholipid syndrome, rheumatoid arthritis, dermatitis, allergic 
encephalomyelitis, glomerulonephritis, Goodpasture's Syndrome, Graves' Disease, 
Multiple Sclerosis, Myasthenia Gravis, Neuritis, Ophthalmia, Bullous Pemphigoid, 
Pemphigus, PolyendocrinopathiesrPurpuraT Reiter TDiseaseT Stiff -Man Sy ndrome7 ~ 
Autoimmune Thyroiditis, Systemic Lupus Erythematosus, Autoimmune Pulmonary 
Inflammation, Guillain-Barre Syndrome, insulin dependent diabetes mellitis, and 
autoimmune inflammatory eye disease. 
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Similarly, allergic reactions and conditions, such as asthma (particularly 
allergic asthma) or other respiratory problems, may also be treated by a polypeptide 
or polynucleotide of the present invention. Moreover, these molecules can be used to 
treat anaphylaxis, hypersensitivity to an antigenic molecule, or blood group 
incompatibility. 

A polynucleotide or polypeptide of the present invention may also be used to 
treat and/or prevent organ rejection or graft- versus-host disease (GVHD). Organ 
rejection occurs by host immune cell destruction of the transplanted tissue through an 
immune response. Similarly, an immune response is also involved in GVHD, but, in 
this case, the foreign transplanted immune cells destroy the host tissues. The 
administration of a polypeptide or polynucleotide of the present invention that inhibits 
an immune response, particularly the proliferation, differentiation, or chemotaxis of 
T-cells, may be an effective therapy in preventing organ rejection or GVHD. 

Similarly, a polypeptide or polynucleotide of the present invention may also 
be used to modulate mflammatiotl.. For example, the polypeptide or polynucleotide 
may inhibit the proliferation and differentiation of cells involved in an inflammatory 
response. These molecules can be used to treat inflammatory conditions, both chronic 
and acute conditions, including inflammation associated with infection (e.g., septic 
shock, sepsis, or systemic inflammatory response syndrome (SIRS)), ischemia- 
reperfusion injury, endotoxin lethality, arthritis, complement-mediated hyperacute 
rejection, nephritis, cytokine or chemokine induced lung injury, inflammatory bowel 
disease, Crohn's disease, or resulting from over production of cytokines (e.g., TNF or 
IL-1.) 

Hvperproliferative Disorders 

A polypeptide or polynucleotide can be used to treat or detect 
hyperproliferative disorders, including neoplasms. A polypeptide or polynucleotide 
-of the present invention may inhibit the^roliferation of the disorder through director 
indirect interactions. Alternatively, a polypeptide or polynucleotide of the present 
invention may proliferate other cells which can inhibit the hyperproliferative disorder. 
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For example, by increasing an immune response, particularly increasing 
antigenic qualities of the hypeiproliferative disorder or by proliferating, 
differentiating, or mobilizing T-cells, hypeiproliferative disorders can be treated. 
This immune response may be increased by either enhancing an existing immune 
response, or by initiating a new immune response. Alternatively, decreasing an 
immune response may also be a method of treating hyperproliferative disorders, such 
as a chemotherapeutic agent. 

Examples of hyperproliferative disorders that can be treated or detected by a 
polynucleotide or polypeptide of the present invention include, but are not limited to 
neoplasms located in the: abdomen, bone, breast, digestive system, liver, pancreas, 
peritoneum, endocrine glands (adrenal, parathyroid, pituitary, testicles, ovary, thymus, 
thyroid), eye, head and neck, nervous (central and peripheral), lymphatic system, 
pelvic, skin, soft tissue, spleen, thoracic, and urogenital. 

Similarly, other hyperproliferative disorders can also be treated or detected by 
a polynucleotide of "polypeptide of the present invention. Examples of such 
hyperproliferative disorders include, but are not limited to: 

hypergammaglobulinemia, lymphoproliferative disorders, paraproteinemias, purpura, 
sarcoidosis, Sezary Syndrome, Waldenstron's Macroglobulinemia, Gaucher's 
Disease, histiocytosis, and any other hyperproliferative disease, besides neoplasia, 
located in an organ system listed above. 

Infectious Disease 

A polypeptide or polynucleotide of the present invention can be used to treat 
or detect infectious agents. For example, by increasing the immune response, 
particularly increasing the proliferation and differentiation of B and/or T cells, 
infectious diseases may be treated. The immune response may be increased by either 
enhancing an existing immune response, or by initiating a new immune response. 
_ Alternatively, the polypeptide or polynucleotide" of the present invention majT aTscT 
directly inhibit the infectious agent, without necessarily eliciting an immune response. 

Viruses are one example of an infectious agent that can cause disease or 
symptoms that can be treated or detected by a polynucleotide or polypeptide of the 
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present invention. Examples of viruses, include, but are not limited to the following 
DNA and RNA viral families: Arbovirus, Adenoviridae, Arenaviridae, Arterivirus, 
Birnaviridae, Bunyaviridae, Caliciviridae, Circoviridae, Coronaviridae, Flaviviridae, 
Hepadnaviridae (Hepatitis), Herpesviridae (such as, Cytomegalovirus, Herpes 
5 Simplex, Herpes Zoster), Mononegavirus (e.g., Paramyxoviridae, Morbilli virus, 
Rhabdoviridae), Orthomyxoviridae (e.g., Influenza), Papovaviridae, Parvoviridae, 
Picornaviridae, Poxviridae (such as Smallpox or Vaccinia), Reoviridae (e.g., 
Rotavirus), Retro viridae (HTLV-I, HTLV-II, Lenti virus), and Togaviridae (e.g., 
Rubivirus). Viruses falling within these families can cause a variety of diseases or 

10 symptoms, including, but not limited to: arthritis, bronchiollitis, encephalitis, eye 

infections (e.g., conjunctivitis, keratitis), chronic fatigue syndrome, hepatitis (A, B, C, 
E, Chronic Active, Delta), meningitis, opportunistic infections (e.g., AIDS), 
pneumonia, Burkitt's Lymphoma, chickenpox , hemorrhagic fever, Measles, Mumps, 
Parainfluenza, Rabies, the common cold, Polio, leukemia, Rubella, sexually 

15 transmitted diseases, skin diseases (e.g., Kaposi's, warts); and viremia. A polypeptide 
or polynucleotide of the present invention can be used to treat or detect any of these 
symptoms or diseases. 

Similarly, bacterial or fungal agents that can cause disease or symptoms and 
that can be treated or detected by a polynucleotide or polypeptide of the present 

20 invention include, but not limited to, the following Gram-Negative and Gram-positive 
bacterial families and fungi: Actinomycetales (e.g., Corynebacterium, 
Mycobacterium, Norcardia), Aspergillosis, Bacillaceae (e.g., Anthrax, Clostridium), 
Bacteroidaceae, Blastomycosis, Bordetella, Borrelia, Brucellosis, Candidiasis, 
Campylobacter, Coccidioidomycosis, Cryptococcosis, Dermatocy coses, 

25 Enterobacteriaceae (Klebsiella, Salmonella, Serratia, Yersinia), Erysipelothrix, 

Helicobacter, Legionellosis, Leptospirosis, Listeria, Mycoplasmatales, Neisseriaceae 
(e.g., Acinetobacter, Gonorrhea, Menigococcal), Pasteurellacea Infections (e.g., 

-ActinobaeillusrHeamophilus; Pasteureira)7Ps^36m6h^,~lGckettsiaceae, 

Chlamydiaceae, Syphilis, and Staphylococcal. These bacterial or fungal families can 

30 cause the following diseases or symptoms, including, but not limited to: bacteremia, 
endocarditis, eye infections (conjunctivitis, tuberculosis, uveitis), gingivitis, 
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opportunistic infections (e.g., AIDS related infections), paronychia, prosthesis-related 
infections, Reiter's Disease, respiratory tract infections, such as Whooping Cough or 
Empyema, sepsis, Lyme Disease, Cat-Scratch Disease, Dysentery, Paratyphoid Fever, 
food poisoning, Typhoid, pneumonia, Gonorrhea, meningitis, Chlamydia, Syphilis, 
Diphtheria, Leprosy, Paratuberculosis, Tuberculosis, Lupus, Botulism, gangrene, 
tetanus, impetigo, Rheumatic Fever, Scarlet Fever, sexually transmitted diseases, skin 
diseases (e.g., cellulitis, dermatocycoses), toxemia, urinary tract infections, wound 
infections. A polypeptide or polynucleotide of the present invention can be used to 
treat or detect any of these symptoms or diseases. 

Moreover, parasitic agents causing disease or symptoms that can be treated or 
detected by a polynucleotide or polypeptide of the present invention include, but not 
limited to, the following families: Amebiasis, Babesiosis, Coccidiosis, 
Cryptosporidiosis, Dientamoebiasis, Dourine, Ectoparasitic, Giardiasis, 
Helminthiasis, Leishmaniasis, Theileriasis, Toxoplasmosis, Trypanosomiasis, and 
Trichomonas. The'se parasites can . cause a variety of diseases or symptoms, including, 
but not limited to: Scabies, Trombiculiasis, eye infections, intestinal disease (e.g., 
dysentery, giardiasis), liver disease, lung disease, opportunistic infections (e.g., AIDS 
related), Malaria, pregnancy complications, and toxoplasmosis. A polypeptide or 
polynucleotide of the present invention can be used to treat or detect any of these 
symptoms or diseases. 

Preferably, treatment using a polypeptide or polynucleotide of the present 
invention could either be by administering an effective amount of a polypeptide to the 
patient, or by removing cells from the patient, supplying the cells with a 
polynucleotide of the present invention, and returning the engineered cells to the 
patient (ex vivo therapy). Moreover, the polypeptide or polynucleotide of the present 
invention can be used as an antigen in a vaccine to raise an immune response against 
infectious disease. 



Regeneration 

A polynucleotide or polypeptide of the present invention can be used to 
differentiate, proliferate, and attract cells, leading to the regeneration of tissues. (See, 
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Science 276:59-87 (1997).) The regeneration of tissues could be used to repair, 
replace, or protect tissue damaged by congenital defects, trauma (wounds, burns, 
incisions, or ulcers), age, disease (e.g. osteoporosis, osteocarthritis, periodontal 
disease, liver failure), surgery, including cosmetic plastic surgery, fibrosis, 
5 reperfusion injury, or systemic cytokine damage. 

Tissues that could be regenerated using the present invention include organs 
(e.g., pancreas, liver, intestine, kidney, skin, endothelium), muscle (smooth, skeletal 
or cardiac), vasculature (including vascular and lymphatics), nervous, hematopoietic, 
and skeletal (bone, cartilage, tendon, and ligament) tissue. Preferably, regeneration 
10 occurs without or decreased scarring. Regeneration also may include angiogenesis. 

Moreover, a polynucleotide or polypeptide of the present invention may 
increase regeneration of tissues difficult to heal. For example, increased 
tendon/ligament regeneration would quicken recovery time after damage. A 
polynucleotide or polypeptide of the present invention could also be used 
15 prophylactically in~an effort to avoid damage. Specific diseases that could be treated 
include of tendinitis, carpal tunnel syndrome, and other tendon or ligament defects. A 
further example of tissue regeneration of non-healing wounds includes pressure 
ulcers, ulcers associated with vascular insufficiency, surgical, and traumatic wounds. 
Similarly, nerve and brain tissue could also be regenerated by using a 
20 polynucleotide or polypeptide of the present invention to proliferate and differentiate 
nerve cells. Diseases that could be treated using this method include central and 
peripheral nervous system diseases, neuropathies, or mechanical and traumatic 
: disorders (e.g., .spinal cord disorders, head trauma, cerebrovascular disease, arid 
stoke). Specifically, diseases associated with peripheral nerve injuries, peripheral 
25 neuropathy (e.g., resulting from chemotherapy or other medical therapies), localized 
neuropathies, and central nervous system diseases (e.g., Alzheimer's disease, 
Parkinson's disease, Huntington's disease, amyotrophic lateral sclerosis, and Shy- 

Drager„syndrome),xould all-be-treated using the polynucleotide or pblypeptide^fthe 

present invention. 

30 

Chemotaxis 
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A polynucleotide or polypeptide of the present invention may have 
chemotaxis activity. A chemotaxic molecule attracts or mobilizes cells (e.g., 
monocytes, fibroblasts, neutrophils, T-cells, mast cells, eosinophils, epithelial and/or 
endothelial cells) to a particular site in the body, such as inflammation, infection, or 
5 site of hyperproliferation. The mobilized cells can then fight off and/or heal the 
particular trauma or abnormality. 

A polynucleotide or polypeptide of the present invention may increase 
chemotaxic activity of particular cells. These chemotactic molecules can then be used 
to treat inflammation, infection, hyperproliferative disorders, or any immune system 
10 disorder by increasing the number of cells targeted to a particular location in the body. 
For example, chemotaxic molecules can be used to treat wounds and other trauma to 
tissues by attracting immune cells to the injured location. Chemotactic molecules of 
the present invention can also attract fibroblasts, which can be used to treat wounds. 

It is also contemplated that a polynucleotide or polypeptide of the present 
15 invention may inhibit chemotactic "activity. These molecules could also be used to 
treat disorders. Thus, a polynucleotide or polypeptide of the present invention could 
be used as an inhibitor of chemotaxis. 

Binding Activity 

20 A polypeptide of the present invention may be used to screen for molecules 

that bind to the polypeptide or for molecules to which the polypeptide binds. The 
binding of the polypeptide and the molecule may activate (agonist), increase, inhibit 
(antagonist), or decrease activity of the polypeptide or the molecule bound. Examples 
of such molecules include antibodies, oligonucleotides, proteins (e.g., receptors),or 

25 small molecules. 

Preferably, the molecule.is closely related to the natural ligand of the 
polypeptide, e.g., a fragment of the ligand, or a natural substrate, a ligand, a structural 

or functional-mimetic .-(SeerGoH^ 

l(2):Chapter 5 (1991).) Similarly, the molecule can be closely related to the natural 

30 receptor to which the polypeptide binds, or at least, a fragment of the receptor capable 
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of being bound by the polypeptide (e.g., active site). In either case, the molecule can 
be rationally designed using known techniques. 

Preferably, the screening for these molecules involves producing appropriate 
cells which express the polypeptide, either as a secreted protein or on the cell 

5 membrane. Preferred cells include cells from mammals, yeast, Drosophila, or £. coli. 
Cells expressing the polypeptide (or cell membrane containing the expressed 
polypeptide) are then preferably contacted with a test compound potentially 
containing the molecule to observe binding, stimulation, or inhibition of activity of 
either the polypeptide or the molecule. 

10 The assay may simply test binding of a candidate compound to the 

polypeptide, wherein binding is detected by a label, or in an assay involving 
competition with a labeled competitor. Further, the assay may test whether the 
candidate compound results in a signal generated by binding to the polypeptide. 
Alternatively, the assay.can be carried out using cell-free preparations, 

15 polypeptide/molecule affixed to a solid support, chemical libraries, or natural product 
mixtures. The assay may also simply comprise the steps of mixing a candidate 
compound with a solution containing a polypeptide, measuring polypeptide/molecule 
activity or binding, and comparing the polypeptide/molecule activity or binding to a 
standard. 

20 Preferably, an ELISA assay can measure polypeptide level or activity in a 

sample (e.g., biological sample) using a monoclonal or polyclonal antibody. The 
antibody can measure polypeptide level or activity by either binding, directly or 
indirectly, to the polypeptide or by competing with the polypeptide for a substrate. 
All of these above assays can be used as diagnostic or prognostic markers. 

25 The molecules discovered using these assays can be used to treat disease or to bring 
about a particular result in a patient (e.g., blood vessel growth) by activating or 
inhibiting the polypeptide/molecule. Moreover, the assays can discover agents which 

may inhibit orenhance the produc tiorTof the polypepade^fFom suitably manipulated 

cells or tissues. 

30 Therefore, the invention includes a method of identifying compounds which 

bind to a polypeptide of the invention comprising the steps of: (a) incubating a 
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candidate binding compound with a polypeptide of the invention; and (b) determining 
if binding has occurred. Moreover, the invention includes a method of identifying 
agonists/antagonists comprising the steps of: (a) incubating a candidate compound 
with a polypeptide of the invention, (b) assaying a biological activity , and (b) 
determining if a biological activity of the polypeptide has been altered. 

Other Activities 

A polypeptide or polynucleotide of the present invention may also increase or 
decrease the differentiation or proliferation of embryonic stem cells, besides, as 
discussed above, hematopoietic lineage. 

A polypeptide or polynucleotide of the present invention may also be used to 
modulate mammalian characteristics, such as body height, weight, hair color, eye 
color, skin, percentage of adipose tissue, pigmentation, size, and shape (e.g., cosmetic 
surgery). Similarly, a polypeptide or polynucleotide of the present invention may be 
used to modulate mammalian metabolism affecting catabolism, anabolism, 
processing, utilization, and storage of energy. 

A polypeptide or polynucleotide of the present invention may be used to 
change a mammal's mental state or physical state by influencing biorhythms, 
caricadic rhythms, depression (including depressive disorders), tendency for violence, 
tolerance for pain, reproductive capabilities (preferably by Activin or Inhibin-like 
activity), hormonal or endocrine levels, appetite, libido, memory, stress, or other 
cognitive qualities. 

A polypeptide or polynucleotide of the present invention may also be used as a 
food additive or preservative, such as to increase or decrease storage capabilities, fat 
content, lipid, protein, carbohydrate, vitamins, minerals, cofactors or other nutritional 
components. 

Other Preferred Embodiments 

Other preferred embodiments of the claimed invention include an isolated 
nucleic acid molecule comprising a nucleotide sequence which is at least 95% 
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identical to a sequence of at least about 50 contiguous nucleotides in the nucleotide 
sequence of SEQ ID NO:X wherein X is any integer as defined in Table 1. 

Also preferred is a nucleic acid molecule wherein said sequence of contiguous 
nucleotides is included in the nucleotide sequence of SEQ ID NO:X in the range of 
5 positions beginning with the nucleotide at about the position of the 5' Nucleotide of 
the Clone Sequence and ending with the nucleotide at about the position of the 3' 
Nucleotide of the Clone Sequence as defined for SEQ ID NO:X in Table 1. 

Also preferred is a nucleic acid molecule wherein said sequence of contiguous 
nucleotides is included in the nucleotide sequence of SEQ ID NO:X in the range of 

10 positions beginning with the nucleotide at about the position of the 5' Nucleotide of 
the Start Codon and ending with the nucleotide at about the position of the 3' 
Nucleotide of the Clone Sequence as defined for SEQ ID NO:X in Table 1. 

Similarly preferred is a nucleic acid molecule wherein said sequence of 
contiguous nucleotides is included in the nucleotide sequence of SEQ ID NO:X in the 

15 range of positions beginning with the nucleotide at about the position of the 5* 
Nucleotide of the First Amino Acid of the Signal Peptide and ending with the 
nucleotide at about the position of the 3' Nucleotide of the Clone Sequence as defined 
for SEQ ID NO:X in Table 1 . 

Also preferred is an isolated nucleic acid molecule comprising a nucleotide 

20 sequence which is at least 95% identical to a sequence of at least about 150 
contiguous nucleotides in the nucleotide sequence of SEQ ID NO:X. 

Further preferred is an isolated nucleic acid molecule comprising a nucleotide 
sequence which is at least 95 % identical to a sequence of at least about 500 
contiguous nucleotides in the nucleotide sequence of SEQ ID NO:X. 

25 A further preferred embodiment is a nucleic acid molecule comprising a 

nucleotide sequence which is at .least 95% identical to the nucleotide sequence of SEQ 
ID NO:X beginning with the nucleotide at about the position of the 5' Nucleotide of 

the First Amino Acid-of-the-Signal~Peptide"and"en3ing" withlhelmcFeoUde at about 

the position of the 3' Nucleotide of the Clone Sequence as defined for SEQ ID NO:X 

30 in Table 1. 
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A further preferred embodiment is an isolated nucleic acid molecule 
comprising a nucleotide sequence which is at least 95% identical to the complete 
nucleotide sequence of SEQ ID NO:X. 

Also preferred is an isolated nucleic acid molecule which. hybridizes under 
5 stringent hybridization conditions to a nucleic acid molecule, wherein said nucleic 
acid molecule which hybridizes does not hybridize under stringent hybridization 
conditions to a nucleic acid molecule having a nucleotide sequence consisting of only 
A residues or of only T residues. 

Also preferred is a composition of matter comprising a DNA molecule which 
10 comprises a human cDNA clone identified by a cDNA Clone Identifier in Table 1, 
which DNA molecule is contained in the material deposited with the American Type 
Culture Collection and given the ATCC Deposit Number shown in Table 1 for said 
cDNA Clone Identifier. 

Also preferred is an isolated nucleic acid molecule comprising a nucleotide 
15 sequence which is at least 95% identical to a sequence of at least 50 contiguous 

nucleotides in the nucleotide sequence of a human cDNA clone identified by a cDNA 
Clone Identifier in Table 1 , which DNA molecule is contained in the deposit given the 
ATCC Deposit Number shown in Table 1 . 

Also preferred is an isolated nucleic acid molecule, wherein said sequence of 
20 at least 50 contiguous nucleotides is included in the nucleotide sequence of the 
complete open reading frame sequence encoded by said human cDNA clone. 

Also preferred is an isolated nucleic acid molecule comprising a nucleotide 
sequence which is at least 95% identical to sequence of at least 150 contiguous 
nucleotides in the nucleotide sequence encoded by said human cDNA clone. 
25 A further preferred embodiment is an isolated nucleic acid molecule 

comprising a nucleotide sequence which is at least 95% identical to sequence of at 
least 500 contiguous nucleotides in the nucleotide sequence encoded by said human 

cDNA.clone 

A further preferred embodiment is an isolated nucleic acid molecule 
30 comprising a nucleotide sequence which is at least 95% identical to the complete 
nucleotide sequence encoded by said human cDNA clone. 
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A further preferred embodiment is a method for detecting in a biological 
sample a nucleic acid molecule comprising a nucleotide sequence which is at least 
95% identical to a sequence of at least 50 contiguous nucleotides in a sequence 
selected from the group consisting of: a nucleotide sequence of SEQ ID NO:X 
5 wherein X is any integer as defined in Table 1 ; and a nucleotide sequence encoded by 
a human cDNA clone identified by a cDNA Clone Identifier in Table 1 and contained 
in the deposit with the ATCC Deposit Number shown for said cDNA clone in Table 
1 ; which method comprises a step of comparing a nucleotide sequence of at least one 
nucleic acid molecule in said sample with a sequence selected from said group and 

10 determining whether the sequence of said nucleic acid molecule in said sample is at 
least 95% identical to said selected sequence. 

Also preferred is the above method wherein said step of comparing sequences 
comprises determining the extent of nucleic acid hybridization between nucleic acid 
molecules in said sample and a nucleic acid molecule comprising said sequence 

15 selected from said group. Similarly, also preferred is the above method wherein said 
step of comparing sequences is performed by comparing the nucleotide sequence 
determined from a nucleic acid molecule in said sample with said sequence selected 
from said group. The nucleic acid molecules can comprise DNA molecules or RNA 
molecules. 

20 A further preferred embodiment is a method for identifying the species, tissue 

or cell type of a biological sample which method comprises a step of detecting nucleic 
acid molecules in said sample, if any, comprising a nucleotide sequence that is at least 
95% identical to a sequence of at least 50 contiguous nucleotides in a sequence 
selected from the group consisting of: a nucleotide sequence of SEQ ID NO:X 

25 wherein X is any integer as defined in Table 1 ; and a nucleotide sequence encoded by 
a human cDNA clone identified by a cDNA Clone Identifier in Table 1 and contained 
in the deposit with the ATCC Deposit Number shown for said cDNA clone in Table 

The method for identifying the species, tissue or cell type of a biological 
30 sample can comprise a step of detecting nucleic acid molecules comprising a 

nucleotide sequence in a panel of at least two nucleotide sequences, wherein at least 
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one sequence in said panel is at least 95% identical to a sequence of at least 50 

contiguous nucleotides in a sequence selected from said group. 

Also preferred is a method for diagnosing in a subject a pathological condition 

associated with abnormal structure or expression of a gene encoding a secreted 
5 protein identified in Table 1 , which method comprises a step of detecting in a 

biological sample obtained from said subject nucleic acid molecules, if any, 

comprising a nucleotide sequence that is at least 95% identical to a sequence of at 

least 50 contiguous nucleotides in a sequence selected from the group consisting of: a 

nucleotide sequence of SEQ ID NO:X wherein X is any integer as defined in Table 1; 
10 and a nucleotide sequence encoded by a human cDNA clone identified by a cDNA 

Clone Identifier in Table 1 and contained in the deposit with the ATCC Deposit 

Number shown for said cDNA clone in Table 1 . 

The method for diagnosing a pathological condition can comprise a step of 

detecting nucleic acid molecules comprising a nucleotide sequence in a panel of at 
15 least two nucleotidesequences, wherein at least one sequence in said panel is at least 

95% identical to a sequence of at least 50 contiguous nucleotides in a sequence 

selected from said group. 

Also preferred is a composition of matter comprising isolated nucleic acid 

molecules wherein the nucleotide sequences of said nucleic acid molecules comprise 
20 a panel of at least two nucleotide sequences, wherein at least one sequence in said 

panel is at least 95% identical to a sequence of at least 50 contiguous nucleotides in a 

sequence selected from the group consisting of: a nucleotide sequence of SEQ ID 

NO:X wherein X is any integer as defined in Table 1 ; and a nucleotide sequence 

encoded by a human cDNA clone identified by a cDNA Clone Identifier in Table 1 
25 and contained in the deposit with the ATCC Deposit Number shown for said cDNA 

clone in Table 1 . The nucleic acid molecules can comprise DN A molecules or RNA 

molecules. 

Also_pxeferred is ^isolated polypeptide comprising" ail amino~acid sequence 

at least 90% identical to a sequence of at least about 10 contiguous amino acids in the 
30 amino acid sequence of SEQ ID NO:Y wherein Y is any integer as defined in Table 1. 
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Also preferred is a polypeptide, wherein said sequence of contiguous amino 
acids is included in the amino acid sequence of SEQ ID NO: Y in the range of 
positions beginning with the residue at about the position of the First Amino Acid of 
the Secreted Portion and ending with the residue at about the Last Amino Acid of the 
5 Open Reading Frame as set forth for SEQ ID NO:Y in Table 1. 

Also preferred is an isolated polypeptide comprising an amino acid sequence 
at least 95% identical to a sequence of at least about 30 contiguous amino acids in the 
amino acid sequence of SEQ ID NO:Y. 

Further preferred is an isolated polypeptide comprising an amino acid 
10 sequence at least 95% identical to a sequence of at least about 100 contiguous amino 
acids in the amino acid sequence of SEQ ID NO: Y. 

Further preferred is an isolated polypeptide comprising an amino acid 
sequence at least 95% identical to the complete amino acid sequence of SEQ ID 
NO.Y. 

15 Further preferred is an isolated polypeptide comprising ah amino acid 

sequence at least 90% identical to a sequence of at least about 10 contiguous amino 
acids in the complete amino acid sequence of a secreted protein encoded by a human 
cDNA clone identified by a cDNA Clone Identifier in Table 1 and contained in the 
deposit with the ATCC Deposit Number shown for said cDNA clone in Table 1. 

20 Also preferred is a polypeptide wherein said sequence of contiguous amino 

acids is included in the amino acid sequence of a secreted portion of the secreted 
protein encoded by a human cDNA clone identified by a cDN A Clone Identifier in 
Table 1 and contained in the deposit with the ATCC Deposit Number shown for said 
cDNA clone in Table 1 . 

25 Also preferred is an isolated polypeptide comprising an amino acid sequence 

at least 95% identical to a sequence of at least about 30 contiguous amino acids in the 
amino acid sequence of the secreted portion of the protein encoded by a human cDNA 
_clone id^ntified_by_a_cDNA-Clone-Identifier in-T-able-l and contained in"ttre"deposit" " 
with the ATCC Deposit Number shown, for said cDNA clone in Table 1. 

30 Also preferred is an isolated polypeptide comprising an amino acid sequence 

at least 95% identical to a sequence of at least about 100 contiguous amino acids in 
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the amino acid sequence of the secreted portion of the protein encoded by a human 
cDNA clone identified by a cDNA Clone Identifier in Table 1 and contained in the 
deposit with the ATCC Deposit Number shown for said cDNA clone in Table 1. 

Also preferred is an isolated polypeptide comprising an amino acid sequence 
at least 95% identical to the amino acid sequence of the secreted portion of the protein 
encoded by a human cDNA clone identified by a cDNA Clone Identifier in Table 1 
and contained in the deposit with the ATCC Deposit Number shown for said cDNA 
clone in Table 1. 

Further preferred is an isolated antibody which binds specifically to a 
polypeptide comprising an amino acid sequence that is at least 90% identical to a 
sequence of at least 10 contiguous amino acids in a sequence selected from the group 
consisting of: an amino acid sequence of SEQ ID NO: Y wherein Y is any integer as 
defined in Table 1 ; and a complete amino acid sequence of a protein encoded by a 
human cDNA clone identified by a cDNA Clone Identifier in Table 1 and contained 
in the deposit with "the ATCC Deposit Number shown for said cDNA clone in Table 
1. 

Further preferred is a method for detecting in a biological sample a 
polypeptide comprising an amino acid sequence which is at least 90% identical to a 
sequence of at least 10 contiguous amino acids in a sequence selected from the group 
consisting of: an amino acid sequence of SEQ ID NO:Y wherein Y is any integer as 
defined in Table 1 ; and a complete amino acid sequence of a protein encoded by a 
human cDNA clone identified by a cDNA Clone Identifier in Table 1 and contained 
in the deposit with the ATCC Deposit Number shown for said cDNA clone in Table 
1; which method comprises a step of comparing an amino acid sequence of at least 
one polypeptide molecule in said sample with a sequence selected from said group 
and determining whether the sequence of said polypeptide molecule in said sample is 
at least 90% identical to said sequence of at least 10 contiguous amino acids. 

Also preferred-is-the above method whereinsaidstep of comparihgan amino ~~ 

acid sequence of at least one polypeptide molecule in said sample with a sequence 
selected from said group comprises determining the extent of specific binding of 
polypeptides in said sample to an antibody which binds specifically to a polypeptide 
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comprising an amino acid sequence that is at least 90% identical to a sequence of at 
least 10 contiguous amino acids in a sequence selected from the group consisting of: 
an amino acid sequence of SEQ ID NO: Y wherein Y is any integer as defined in 
Table 1 ; and a complete amino acid sequence of a protein encoded by a human cDNA 
5 clone identified by a cDNA Clone Identifier in Table 1 and contained in the deposit 
with the ATCC Deposit Number shown for said cDNA clone in Table 1 . 

Also preferred is the above method wherein said step of comparing sequences 
is performed by comparing the amino acid sequence determined from a polypeptide 
molecule in said sample with said sequence selected from said group. 

10 Also preferred is a method for identifying the species, tissue or cell type of a 

biological sample which method comprises a step of detecting polypeptide molecules 
in said sample, if any, comprising an amino acid sequence that is at least 90% 
identical to a sequence of at least 10 contiguous amino acids in a sequence selected 
from the group consisting of: an amino acid sequence of SEQ ID NO: Y wherein Y is 

15 any integer as defined in Table 1; and a complete amino acid sequence of a secreted 
protein encoded by a human cDNA clone identified by a cDNA Clone Identifier in 
Table 1 and contained in the deposit with the ATCC Deposit Number shown for said 
cDNA clone in Table 1 . 

Also preferred is the above method for identifying the species, tissue or cell 

20 type of a biological sample, which method comprises a step of detecting polypeptide 
molecules comprising an amino acid sequence in a panel of at least two amino acid 
sequences, wherein at least one sequence in said panel is at least 90% identical to a 
sequence of at least 10 contiguous amino acids in a sequence selected from the above 
group. 

25 Also preferred is a method for diagnosing in a subject a pathological condition 

associated with abnormal structure or expression of a gene encoding a secreted 
protein identified in Table 1, which method comprises a step of detecting in a 

biologicd sample obtained-from-said subject polypeptide mole^ an 

amino acid sequence in a panel of at least two amino acid sequences, wherein at least 

30 one sequence in said panel is at least 90% identical to a sequence of at least 10 

contiguous amino acids in a sequence selected from the group consisting of: an amino 
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acid sequence of SEQ ID NO:Y wherein Y is any integer as defined in Table 1; and a 
complete amino acid sequence of a secreted protein encoded by a human cDNA clone 
identified by a cDNA Clone Identifier in Table 1 and contained in the deposit with the 
ATCC Deposit Number shown for said.cDNA clone in Table 1. 
5 In any of these methods, the step of detecting said polypeptide molecules 

includes using an antibody. 

Also preferred is an isolated nucleic acid molecule comprising a nucleotide 
sequence which is at least 95% identical to a nucleotide sequence encoding a 
polypeptide wherein said polypeptide comprises an amino acid sequence that is at 

10 least 90% identical to a sequence of at least 10 contiguous amino acids in a sequence 
selected from the group consisting of: an amino acid sequence of SEQ ID NO:Y 
wherein Y is any integer as defined in Table 1 ; and a complete amino acid sequence 
of a secreted protein encoded by a human cDNA clone identified by a cDNA Clone 
Identifier in Table 1 and contained in the deposit with the ATCC Deposit Number 

15 shown for said cDNA clone in Table lr 

Also preferred is an isolated nucleic acid molecule, wherein said nucleotide 
sequence encoding a polypeptide has been optimized for expression of said 
polypeptide in a prokaryotic host. 

Also preferred is an isolated nucleic acid molecule, wherein said polypeptide 

20 comprises an amino acid sequence selected from the group consisting of: an amino 
acid sequence of SEQ ID NO:Y wherein Y is any integer as defined in Table 1; and a 
complete amino acid sequence of a secreted protein encoded by a human cDNA clone 
identified by a cDNA Clone Identifier in Table 1 and contained in the deposit with the 
ATCC Deposit Number shown for said cDNA clone in Table 1 . 

25 Further preferred is a method of making a recombinant vector comprising 

inserting any of the above isolated nucleic acid molecule into a vector. Also preferred 
is the recombinant vector produced by this method. Also preferred is a method of 

making a recombinant Jiost celLcomprising-introdueing-the vector intoahost cell; as~ 

well as the recombinant host cell produced by this method. 

30 Also preferred is a method of making an isolated polypeptide comprising 

culturing this recombinant host cell under conditions such that said polypeptide is 
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expressed and recovering said polypeptide. Also preferred is this method of making 
an isolated polypeptide, wherein said recombinant host cell is a eukaryotic cell and 
said polypeptide is a secreted portion of a human secreted protein comprising an 
amino acid sequence selected from the group consisting of: an amino acid sequence of 
5 SEQ ID NO: Y beginning with the residue at the position of the First Amino Acid of 
the Secreted Portion of SEQ ID NO: Y wherein Y is an integer set forth in Table 1 and 
said position of the First Amino Acid of the Secreted Portion of SEQ ID NO: Y is 
defined in Table 1 ; and an amino acid sequence of a secreted portion of a protein 
encoded by a human cDNA clone identified by a cDNA Clone Identifier in Table 1 

10 and contained in the deposit with the ATCC Deposit Number shown for said cDNA 
clone in Table 1 . The isolated polypeptide produced by this method is also preferred. 

Also preferred is a method of treatment of an individual in need of an 
increased level of a secreted protein activity, which method comprises administering 
to such an individual a pharmaceutical composition comprising an amount of an 

15 isolated polypeptide^ polynucleotide, or antibody of the claimed invention effective to 
increase the level of said protein activity in said individual. 

Having generally described the invention, the same will be more readily 
understood by reference to the following examples, which are provided by way of 
illustration and are not intended as limiting. 

20 

Examples 

Example 1: Isolation of a Selected cDNA Clone From the Deposited Sample 

Each cDNA clone in a cited ATCC deposit is contained in a plasmid vector. 
25 Table 1 identifies the vectors used to construct the cDNA library from which each 
clone was isolated. In many cases, the vector used to construct the library is a phage 
vector from which a plasmid has been excised. The table immediately below 
corre late^ejglat^plasm^^ 

library. For example, where a particular clone is identified in Table 1 as being 
30 isolated in the vector "Lambda Zap," the corresponding deposited clone is in 
"pBluescript." 
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Vector Used to Construct Library 



Corresponding Deposited 



Plasmid 



Lambda Zap 
Uni-Zap XR 
Zap Express 



pBluescript (pBS) 
pBluescript (pBS) 
pBK 

plafmid B A 
pSportl 

pCMVSport 2.0 
pCMVSport 3.0 
pCR®2.1 



lafmid BA 



pSportl 

pCMVSport 2.0 
pCMVSport 3.0 
pCR®2.1 



Vectors Lambda Zap (U.S. Patent Nos. 5,128,256 and 5,286,636), Uni-Zap 
XR (U.S. Patent Nos. 5,128, 256 and 5,286,636), Zap Express (U.S. Patent Nos. 
5,128,256 and 5,286,636), pBluescript (pBS) (Short, J. M. et al., Nucleic Acids Res. 
16:7583-7600 (1988); Alting-Mees, M. A. and Short, J. M., Nucleic Acids Res. 
17:9494 (1989)) and pBK (Alting-Mees, M. A: et al., Strategies 5:58-61 (1992)) are 
commercially available from Stratagene Cloning Systems, Inc., 1 101 1 N. Torrey 
Pines Road, La Jolla, CA, 92037. pBS contains an ampicillin resistance gene and 
pBK contains a neomycin resistance gene. Both can be transformed into E. coli strain 
XL-1 Blue, also available from Stratagene. pBS comes in 4 forms SK+, SK-, KS+ 
and KS. The S and K refers to the orientation of the polylinker to the T7 and T3 
primer sequences which flank the polylinker region ("S" is for SacI and "K" is for 
Kpnl which are the first sites on each respective end of the linker). or "-" refer to 
the orientation of the fl origin of replication, ("ori'^^such-that in one orientation,- - 
single stranded rescue initiated from the f 1 ori generates sense strand DNA and in the 
other, antisense. 

Vectors pSportl, pCMVSport 2.0 and pCMVSport 3.0, were obtained from 
Life Technologies, Inc., P. O. Box 6009, Gaithersburg, MD 20897. All Sport vectors 
^mmi^^^npicillin_resistance_gene-and^m 

DH10B, also available from Life Technologies. (See, for instance, Gruber, C. E., et 
al., Focus 15:59 (1993).) Vector lafmid BA (Bento Soares, Columbia University, 
NY) contains an ampicillin resistance gene and can be transformed into E. coli strain 
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XL-1 Blue. Vector pCR®2.1, which is available from Invitrogen, 1600 Faraday 
Avenue, Carlsbad, CA 92008, contains an ampicillin resistance gene and may be 
transformed into E. coli strain DH10B, available from Life Technologies. (See, for 
instance, Clark, J. M., Nuc. Acids Res. 16:9677-9686 (1988) and Mead, D. et al., 
Bio/Technology 9: (1991).) Preferably, a polynucleotide of the present invention 
does not comprise the phage vector sequences identified for the particular clone in 
Table 1, as well as the corresponding plasmid vector sequences designated above. 

The deposited material in the sample assigned the ATCC Deposit Number 
cited in Table 1 for any given cDNA clone also may contain one or more additional 
plasmids, each comprising a cDNA clone different from that given clone. Thus, 
deposits sharing the same ATCC Deposit Number contain at least a plasmid for each 
cDNA clone identified in Table 1. Typically, each ATCC deposit sample cited in 
Table 1 comprises a mixture of approximately equal amounts (by weight) of about 50 
plasmid DNAs, each containing a different cDNA clone; but such a deposit sample 
may include plasmids for more or less than 50 cDNA clones; up to about 500 cDNA 
clones. 

Two approaches can be used to isolate a particular clone from the deposited 
sample of plasmid DNAs cited for that clone in Table 1 . First, a plasmid is directly 
isolated by screening the clones using a polynucleotide probe corresponding to SEQ 
IDNO:X. 

Particularly, a specific polynucleotide with 30-40 nucleotides is synthesized 
using an Applied Biosystems DNA synthesizer according to the sequence reported. 
The oligonucleotide is labeled, for instance, with - 2 P-y- ATP using T4 polynucleotide 
kinase and purified according to routine methods. (E.g., Maniatis et al., Molecular 
Cloning: A Laboratory Manual, Cold Spring Harbor Press, Cold Spring, NY (1982).) 
The plasmid mixture is transformed into a suitable host, as indicated above (such as 
XL-1 Blue (Stratagene)) using techniques known to those of skill in the art, such as 
Jhose_provided-by-the-vector-supplier or in related"publicafiohs~or patenfsliited above. 
The transformants are plated on 1.5% agar plates (containing the appropriate selection 
agent, e.g., ampicillin) to a density of about 150 transformants (colonies) per plate. 
These plates are screened using Nylon membranes according to routine methods for 
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bacterial colony screening (e.g., Sambrook et ah, Molecular Cloning: A Laboratory 
Manual, 2nd Edit., (1989), Cold Spring Harbor Laboratory Press, pages 1.93 to 
1 .104), or other techniques known to those of skill in the art. 

Alternatively, two primers of 17-20 nucleotides derived from both ends of the 
SEQ ID NO:X (i.e., within the region of SEQ ID NO:X bounded by the 5' NT and the 
3* NT of the clone defined in Table 1) are synthesized and used to amplify the desired 
cDNA using the deposited cDNA plasmid as a template. The polymerase chain 
reaction is carried out under routine conditions, for instance, in 25 pi of reaction 
mixture with 0.5 ug of the above cDNA template. A convenient reaction mixture is 
1.5-5 rnM MgCl 2 , 0.01% (w/v) gelatin, 20 fiM each of dATP, dCTP, dGTP, dTTP, 25 
pmol of each primer and 0.25 Unit of Taq polymerase. Thirty five cycles of PCR 
(denaturation at 94°C for 1 min; annealing at 55°C for 1 min; elongation at 72°C for 1 
min) are performed with a Perkin-Elmer Cetus automated thermal cycler. The 
amplified product is analyzed by agarose gel electrophoresis and the DNA band with 
expected molecular Aveight is excised and purified. The PCR product is verified to be 
the selected sequence by subcloning and sequencing the DNA product. 

Several methods are available for the identification of the 5' or 3' non-coding 
portions of a gene which may not be present in the deposited clone. These methods 
include but are not limited to, filter probing, clone enrichment using specific probes, 
and protocols similar or identical to 5' and 3' "RACE" protocols which are well 
known in the art. For instance, a method similar to 5* RACE is available for 
generating the missing 5* end of a desired full-length transcript. (Fromont-Racine et 

al., Nucleic Acids Res. 21(7)jl683 r 1684 (1993).) . - - - - 

Briefly, a specific RNA oligonucleotide is ligated to the 5' ends of a 
population of RNA presumably containing full-length gene RNA transcripts. A 
primer set containing a primer specific to the ligated RNA oligonucleotide and a 
primer specific to a known sequence of the gene of interest is used to PCR amplify 

sequenced and used to generate the full length gene. 

This above method starts with total RNA isolated from the desired source, 
although poly-A+ RNA can be used. The RNA preparation can then be treated with 
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phosphatase if necessary to eliminate 5* phosphate groups on degraded or damaged 
RNA which may interfere with the later RNA ligase step. The phosphatase should 
then be inactivated and the RNA treated with tobacco acid pyrophosphatase in order 
to remove the cap structure present at the 5* ends of messenger RNAs. This reaction 
5 leaves a 5' phosphate group at the 5* end of the cap cleaved RNA which can then be 
ligated to an RNA oligonucleotide using T4 RNA ligase. 

This modified RNA preparation is used as a template for first strand cDNA 
synthesis using a gene specific oligonucleotide. The first strand synthesis reaction is 
used as a template for PCR amplification of the desired 5' end using a primer specific 
10 to the ligated RNA oligonucleotide and a primer specific to the known sequence of 
the gene of interest. The resultant product is then sequenced and analyzed to confirm 
that the 5* end sequence belongs to the desired gene. 

Example 2: Isolation of Genomic Clones Corresponding to a Polynucleotide 

15 A human genomic PI library (Genomic Systems, Inc.) is screened by PCR 

using primers selected for the cDNA sequence corresponding to SEQ ID NO:X., 
according to the method described in Example 1. (See also, Sambrook.) 

Example 3: Tissue Distribution of Polypeptide 

20 Tissue distribution of mRNA expression of polynucleotides of the present 

invention is determined using protocols for Northern blot analysis, described by, 
among others, Sambrook et al. For example, a cDNA probe produced by the method 
described in Example 1 is labeled with P 32 using the rediprime™ DNA labeling " 
system (Amersham Life Science), according to manufacturer's instructions. After 

25 labeling, the probe is purified using CHROMA SPIN- 100™ column (Clontech 
Laboratories, Inc.), according to manufacturer's protocol number PT 1200-1. The 
purified labeled probe is then used to examine various human tissues for mRNA 

expression. — ~~ 

Multiple Tissue Northern (MTN) blots containing various human tissues (H) 

30 or human immune system tissues (IM) (Clontech) are examined with the labeled 
probe using ExpressHyb™ hybridization solution (Clontech) according to 



WO 99/47540 



PCT/US99/05804 



237 

manufacturer's protocol number PT1 190-1. Following hybridization and washing, the 
blots are mounted and exposed to film at -70°C overnight, and the films developed 
according to standard procedures. 

Example 4: Chromosomal Mapping of the Polynucleotides 

An oligonucleotide primer set is designed according to the sequence at the 5' 
end of SEQ ID NO:X. This primer preferably spans about 100 nucleotides. This 
primer set is then used in a polymerase chain reaction under the following set of 
conditions : 30 seconds, 95°C; 1 minute, 56°C; 1 minute, 70°C. This cycle is 
repeated 32 times followed by one 5 minute cycle at 70°C. Human, mouse, and 
hamster DNA is used as template in addition to a somatic cell hybrid panel containing 
individual chromosomes or chromosome fragments (Bios, Inc). The reactions is 
analyzed on either 8% polyacrylamide gels or 3.5 % agarose gels. Chromosome 
mapping is determined by the presence of an approximately 100 bp PCR fragment in 
the particular somatic cell hybrid. 

Example 5: Bacterial Expression of a Polypeptide 

A polynucleotide encoding a polypeptide of the present invention is amplified 
using PCR oligonucleotide primers corresponding to the 5' and 3' ends of the DNA 
sequence, as outlined in Example 1, to synthesize insertion fragments. The primers 
used to amplify the cDNA insert should preferably contain restriction sites, such as 
BamHI and Xbal, at the 5' end of the primers in order to clone the amplified product 
into the expression vector. For example, BamHI and Xbal correspond to the 
restriction enzyme sites on the bacterial expression vector pQE-9. (Qiagen, Inc., 
Chatsworth, CA). This plasmid vector encodes antibiotic resistance (Amp 1 ), a 
bacterial origin of replication (ori), an IPTG-regulatable promoter/operator (P/O), a 
ribosome binding site (RBS), a 6-histidine tag (6-His), and restriction enzyme cloning 
sites . — — " 

The pQE-9 vector is digested with BamHI and Xbal and the amplified 
fragment is ligated into the pQE-9 vector maintaining the reading frame initiated at 
the bacterial RBS. The ligation mixture is then used to transform the E. coli strain 
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M15/rep4 (Qiagen, Inc.) which contains multiple copies of the plasmid pREP4, which 

expresses the lad repressor and also confers kanamycin resistance (Kan r ). 
Transformants are identified by their ability to grow on LB plates and 
ampicillin/kanamycin resistant colonies are selected. Plasmid DNA is isolated and 
5 confirmed by restriction analysis. 

Clones containing the desired constructs are grown overnight (O/N) in liquid 
culture in LB media supplemented with both Amp (100 ug/ml) and Kan (25 ug/ml). 
The O/N culture is used to inoculate a large culture at a ratio of 1:100 to 1:250. The 
cells are grown to an optical density 600 (O.D. 600 ) of between 0.4 and 0.6. IPTG 

10 (Isopropyl-B-D-thiogalacto pyranoside) is then added to a final concentration of 1 
mM. IPTG induces by inactivating the lad repressor, clearing the P/O leading to 
increased gene expression. 

Cells are grown for an extra 3 to 4 hours. Cells are then harvested by 
centrifugation (20 mins at 6000Xg). The cell pellet is solubilized in the chaotropic 

15 agent 6 Molar Guanidine HC1 by stirring for 3^4 hours at 4°C. The cell debris is 

removed by centrifugation, and the supernatant containing the polypeptide is loaded 
onto a nickel-nitrilo-tri-acetic acid ("Ni-NTA") affinity resin column (available from 
QIAGEN, Inc., supra). Proteins with a 6 x His tag bind to the Ni-NTA resin with 
high affinity and can be purified in a simple one-step procedure (for details see: The 

20 QIAexpressionist (1995) QIAGEN, Inc., supra). 

Briefly, the supernatant is loaded onto the column in 6 M guanidine-HCl, pH 
8, the column is first washed with 10 volumes of 6 M guanidine-HCl, pH 8, then 
washed with 10 volumes of 6 M guanidine-HCl pH 6, and finally the polypeptide 's " 
eluted with 6 M guanidine-HCl, pH 5. 

25 The purified protein is then renatured by dialyzing it against phosphate- 

buffered saline (PBS) or 50 mM Na-acetate, pH 6 buffer plus 200 mM NaCL 
Alternatively, the protein can be successfully refolded while immobilized on the Ni- 
NTA column. The recommer^d^nditions-are as-follows: renature^sing~a'Iinear 

_ 6M-1M urea gradient in 500 mM NaCl, 20% glycerol, 20 mM Tris/HCl pH 7.4, 

30 containing protease inhibitors. The renaturation should be performed over a period of 
1.5 hours or more. After renaturation the proteins are eluted by the addition of 250 
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mM immidazole. Immidazole is removed by a final dialyzing step against PBS or 50 
mM sodium acetate pH 6 buffer plus 200 mM NaCl. The purified protein is stored at 
4°C or frozen at -80° C. 

In addition to the above expression vector, the present invention further 
5 includes an expression vector comprising phage operator and promoter elements 

operatively linked to a polynucleotide of the present invention, called pHE4a. (ATCC 
Accession Number 209645, deposited on February 25, 1998.) This vector contains: 
1) a neomy ^phosphotransferase gene as a selection marker, 2) an E. coli origin of 
replication, 3) a T5 phage promoter sequence, 4) two lac operator sequences, 5) a 

10 . Shine-Delgarno sequence, and 6) the lactose operon repressor gene (laclq). The 
origin of replication (oriC) is derived from pUC19 (LTI, Gaithersburg, MD). The 
promoter sequence and operator sequences are made synthetically. 

DNA can be inserted into the pHEa by restricting the vector with Ndel and 
Xbal, BamHI, Xhol, or Asp718, running the restricted product on a gel, and isolating 

15 the larger fragment {the stuffer fragment should be about 310 base pairs). The DN A 
insert is generated according to the PCR protocol described in Example 1, using PCR 
primers having restriction sites for Ndel (5' primer) and Xbal, BamHI, Xhol, or 
Asp718 (3' primer). The PCR insert is gel purified and restricted with compatible 
enzymes. The insert and vector are ligated according to standard protocols. 

20 The engineered vector could easily be substituted in the above protocol to 

express protein in a bacterial system. 

Example 6: Purification of a Polypeptide from an Inclusion Body 

The following alternative method can be used to purify a polypeptide 
25 expressed in E coli when it is present in the form of inclusion bodies. Unless 
otherwise specified, all of the following steps are conducted at 4-10°C. 

Upon completion of the production phase of the E. coli fermentation, the cell 

culture is cooled to 4-10°C and the cells Jiarvested_byjcontinuous centrifugationat 

15,000 rpm (Heraeus Sepatech). On the basis of the expected yield of protein per unit 
30 weight of cell paste and the amount of purified protein required, an appropriate 

amount of cell paste, by weight, is suspended in a buffer solution containing 100 mM 
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Tris, 50 mM EDTA, pH 7.4. The cells are dispersed to a homogeneous suspension 
using a high shear mixer. 

The cells are then lysed by passing the solution through a microfluidizer 
(Microfuidics, Corp. or APV Gaulin, Inc.) twice at 4000-6000 psi. The homogenate 
5 is then mixed with NaCl solution to a final concentration of 0.5 M NaCl, followed by 
centrifugation at 7000 xg for 15 min. The resultant pellet is washed again using 0.5M 
NaCl, 100 mM Tris, 50 mM EDTA, pH 7.4. 

The resulting washed inclusion bodies are solubilized with 1.5 M guanidine 
hydrochloride (GuHCl) for 2-4 hours. After 7000 xg centrifugation for 15 min., the 
10 pellet is discarded and the polypeptide containing supernatant is incubated at 4°C 
overnight to allow further GuHCl extraction. 

Following high speed centrifugation (30,000 xg) to remove insoluble particles, 
the GuHCl solubilized protein is refolded by quickly mixing the GuHCl extract with 
20 volumes of buffer containing 50 mM sodium, pH 4.5, 150 mM NaCl, 2 mM EDTA 
15 by vigorous stirring; The refolded diluted protein solution is kept at 4°C without 
mixing for 12 hours prior to further purification steps. 

To clarify the refolded polypeptide solution, a previously prepared tangential 
filtration unit equipped with 0.16 Jim membrane filter with appropriate surface area 
(e.g., Filtron), equilibrated with 40 mM sodium acetate, pH 6.0 is employed. The 
20 filtered sample is loaded onto a cation exchange resin (e.g., Poros HS-50, Perseptive 
Biosystems). The column is washed with 40 mM sodium acetate, pH 6.0 and eluted 
with 250 mM, 500 mM, 1000 mM, and 1500 mM NaCl in the same buffer, in a 
stepwise manner. The absorbance at 280 nm of the effluent is continuously 
monitored. Fractions are collected and further analyzed by SDS-PAGE. 
25 Fractions containing the polypeptide are then pooled and mixed with 4 

volumes of water. The diluted sample is then loaded onto a previously prepared set of 
tandem columns of strong anion (Poros HQ-50, Perseptive Biosystems) and weak 
anion (Poros CM-20, Perseptive^(^ystems)„exchange resins^ The columnsare 
equilibrated with 40 mM sodium acetate, pH 6.0. Both columns are washed with 40 
30 mM sodium acetate, pH 6.0, 200 mM NaCl. The CM-20 column is then eluted using 
a 10 column volume linear gradient ranging from 0.2 M NaCl, 50 mM sodium 
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acetate, pH 6.0 to 1.0 M NaCl, 50 mM sodium acetate, pH 6.5. Fractions are 
collected under constant monitoring of the effluent. Fractions containing the 
polypeptide (determined, for instance, by 16% SDS-PAGE) are then pooled. 

The resultant polypeptide should exhibit greater than 95% purity after the 
above refolding and purification steps. No major contaminant bands should be 
observed from Commassie blue stained 16% SDS-PAGE gel when 5 (ig of purified 
protein is loaded. The purified protein can also be tested for endotoxin/LPS 
contamination, and typically the LPS content is less than 0. 1 ng/ml according to LAL 
assays. 

Example 7: Cloning and Expression of a Polypeptide in a Baculovirus 
Expression System 

In this example, the plasmid shuttle vector pA2 is used to insert a 
polynucleotide into a baculovirus to express a polypeptide. This expression vector 
contains the strong polyhedrin promoter of the Autographa californica nuclear 
polyhedrosis virus (AcMNPV) followed by convenient restriction sites such as 
BamHI, Xba I and Asp718. The polyadenylation site of the simian virus 40 ("SV40") 
is used for efficient polyadenylation. For easy selection of recombinant virus, the 
plasmid contains the beta-galactosidase gene from E. coli under control of a weak 
Drosophila promoter in the same orientation, followed by the polyadenylation signal 
of the polyhedrin gene. The inserted genes are flanked on both sides by viral 
sequences for cell-mediated homologous recombination with wild-type viral DNA to 
generate a viable virus that express-the cloned-polynucleotide. 

Many other baculovirus vectors can be used in place of the vector above, such 
as pAc373, pVL941, and pAcIMl, as one skilled in the art would readily appreciate, 
as long as the construct provides appropriately located signals for transcription, 
translation, secretion and the like, including a signal peptide and an in-frame AUG as 
reqmr^.Smcl^ 
39 (1989). 

Specifically, the cDNA sequence contained in the deposited clone, including 
the AUG initiation codon and the naturally associated leader sequence identified in 
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Table 1, is amplified using the PCR protocol described in Example 1. If the naturally 
occurring signal sequence is used to produce the secreted protein, the pA2 vector does 
not need a second signal peptide. Alternatively, the vector can be modified (pA2 GP) 
to include a baculovirus leader sequence, using the standard methods described in 
5 Summers et al., "A Manual of Methods for Baculovirus Vectors and Insect Cell 
Culture Procedures," Texas Agricultural Experimental Station Bulletin No. 1555 
(1987). 

The amplified fragment is isolated from a 1 % agarose gel using a 
commercially available kit ("Geneclean," BIO 101 Inc., La Jolla, Ca.). The fragment 
10 then is digested with appropriate restriction enzymes and again purified on a 1% 
agarose gel. 

The plasmid is digested with the corresponding restriction enzymes and 
optionally, can be dephosphorylated using calf intestinal phosphatase, using routine 
procedures known in the art. The DNA is then isolated from a 1% agarose gel using a 

15 commercially available kit ("Geneclean " BIO 101 Inc., La Jolla, Ca.). 

The fragment and the dephosphorylated plasmid are ligated together with T4 
DNA ligase. E. coli HB101 or other suitable E. coli hosts such as XL-1 Blue 
(Stratagene Cloning Systems, La Jolla, CA) cells are transformed with the ligation 
mixture and spread on culture plates. Bacteria containing the plasmid are identified 

20 by digesting DNA from individual colonies and analyzing the digestion product by 
gel electrophoresis. The sequence of the cloned fragment is confirmed by DNA 
sequencing. 

Five jig of a plasmid containing the polynucleotide is co-transfected with l .O 
|ig of a commercially available linearized baculovirus DNA ("BaculoGold™ 
25 baculovirus DNA", Pharmingen, San Diego, CA), using the lipofection method 

described by Feigner et al., Proc. Natl. Acad. Sci. USA 84:7413-7417 (1987). One ^g 
of BaculoGold™ virus DNA and 5 jig of the plasmid are mixed in a sterile well of a 
j™c£btteij)J^ 

Inc., Gaithersburg, MD). Afterwards, 10 |il Lipofectin plus 90 \x\ Grace's medium are 
30 added, mixed and incubated for 15 minutes at room temperature. Then the 

transfection mixture is added drop-wise to Sf9 insect cells (ATCC CRL 1711) seeded 
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in a 35 mm tissue culture plate with 1 ml Grace's medium without serum. The plate is 
then incubated for 5 hours at 27° C. The transfection solution is then removed from 
the plate and 1 ml of Grace's insect medium supplemented with 10% fetal calf serum 
is added. Cultivation is then continued at 27° C for four days. 
5 After four days the supernatant is collected and a plaque assay is performed, 

as described by Summers and Smith, supra. An agarose gel with "Blue Gal" (Life 
Technologies Inc., Gaithersburg) is used to allow easy identification and isolation of 
gal-expressing clones, which produce blue-stained plaques. (A detailed description of 
a "plaque assay" of this type can also be found in the user's guide for insect cell 

10 culture and baculovirology distributed by Life Technologies Inc., Gaithersburg, page 
9-10.) After appropriate incubation, blue stained plaques are picked with the tip of a 
micropipettor (e.g., Eppendorf). The agar containing the recombinant viruses is then 
resuspended in a microcentrifuge tube containing 200 \x\ of Grace's medium and the 
suspension containing the recombinant baculovirus is used to infect Sf9 cells seeded 

15 in 35 mm dishes. Four days later the supernatants of these culture dishes are 
harvested and then they are stored at 4° C. 

To verify the expression of the polypeptide, Sf9 cells are grown in Grace f s 
medium supplemented with 1 0% heat-inactivated FBS. The cells are infected with 
the recombinant baculovirus containing the polynucleotide at a multiplicity of 

20 infection ("MOI") of about 2. If radiolabeled proteins are desired, 6 hours later the 
medium is removed and is replaced with SF900 II medium minus methionine and 
cysteine (available from Life Technologies Inc., Rockville, MD). After 42 hours, 5 

|iCi of 35 S-methionine and 5 \id 35 S-cysteine (available from Amersham) are added. 

The cells are further incubated for 16 hours and then are harvested by centrifugation. 

25 The proteins in the supernatant as well as the intracellular proteins are analyzed by 
SDS-PAGE followed by autoradiography (if radiolabeled). 

Microsequencing of the amino acid sequence of the amino terminus of 

purified protein may be used to determine the amino terminaJ_^uenceofthe : 

produced protein . 

30 Example 8: Expression of a Polypeptide in Mammalian Cells 
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The polypeptide of the present invention can be expressed in a mammalian 
cell. A typical mammalian expression vector contains a promoter element, which 
mediates the initiation of transcription of mRNA, a protein coding sequence, and 
signals required for the termination of transcription and polyadenylation of the 
5 transcript. Additional elements include enhancers, Kozak sequences and intervening 
sequences flanked by donor and acceptor sites for RNA splicing. Highly efficient 
transcription is achieved with the early and late promoters from SV40, the long 
terminal repeats (LTRs) from Retroviruses, e.g., RSV, HTLVI, HIVI and the early 
promoter of the cytomegalovirus (CMV). However, cellular elements can also be 

10 used (e.g., the human actin promoter). 

Suitable expression vectors for use in practicing the present invention include, 
for example, vectors such as pSVL and pMSG (Pharmacia, Uppsala, Sweden), 
pRSVcat (ATCC 37152), pSV2dhfr (ATCC 37146), pBC12MI (ATCC 67109), 
pCMVSport 2.0 ? and pCMVSport 3.0. Mammalian host cells that could be used 

15 include, human Hela; 293, H9 and Jurkat cells, mouse NIH3T3 arid C127 cells, Cos 1, 
Cos 7 and CV1, quail QC1-3 cells, mouse L cells and Chinese hamster ovary (CHO) 
cells. 

Alternatively, the polypeptide can be expressed in stable cell lines containing 
the polynucleotide integrated into a chromosome. The co-transfection with a 

20 selectable marker such as dhfr, gpt,. neomycin, hygromycin allows the identification 
and isolation of the transfected cells. 

The transfected gene can also be amplified to express large amounts of the 
encoded protein. The DHFR (dihydrofolate reductase) marker is-useful in developing " " " 
cell lines that carry several hundred or even several thousand copies of the gene of 

25 interest. (See, e.g., Alt, F. W., et al., J. Biol. Chem. 253:1357-1370 (1978); Hamlin, J. 
L. and Ma, C, Biochem. et Biophys. Acta, 1097:107-143 (1990); Page, M. J. and 
Sydenham, M. A., Biotechnology 9:64-68 (1991).) Another useful selection marker 
is the enzym e gtotam^syi^ 

(1991); Bebbington et al., Bio/Technology 10:169-175 (1992). Using these markers, 
30 the mammalian cells are grown in selective medium and the cells with the highest 

resistance are selected. These cell lines contain the amplified gene(s) integrated into a 
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chromosome. Chinese hamster ovary (CHO) and NSO cells are often used for the 
production of proteins. 

Derivatives of the plasmid pSV2-dhfr (ATCC Accession No. 37146), the 
expression vectors pC4 (ATCC Accession No. 209646) and pC6 (ATCC Accession 
5 No.209647) contain the strong promoter (LTR) of the Rous Sarcoma Virus (Cullen et 
al., Molecular- and Cellular Biology r 438-447 (March, 1985)) plus a fragment of the 
CMV-enhancer (Boshart et al., Cell 41:521-530 (1985).) Multiple cloning sites, e.g., 
with the restriction enzyme cleavage sites BamHI, Xbal and Asp718, facilitate the 
cloning of the gene of interest. The vectors also contain the 3' intron, the 

10 polyadenylation and termination signal of the rat preproinsulin gene, and the mouse 
DHFR gene under control of the SV40 early promoter. 

Specifically, the plasmid pC6, for example, is digested with appropriate 
restriction enzymes and then dephosphorylated using calf intestinal phosphates by 
procedures known in the art. The vector is then isolated from a 1% agarose gel. 

15 A polynucleotide of the present invention is amplified according to the 

protocol outlined in Example 1. If the naturally occurring signal sequence is used to 
produce the secreted protein, the vector does not need a second signal peptide. 
Alternatively, if the naturally occurring signal sequence is not used, the vector can be 
modified to include a heterologous signal sequence. (See, e.g., WO 96/34891.) 

20 The amplified fragment is isolated from a 1% agarose gel using a 

commercially available kit ("Geneclean," BIO 101 Inc., La Jolla, Ca.). The fragment 
then is digested with appropriate restriction enzymes and again purified on a 1 % 

agarose gel. _ - - 

The amplified fragment is then digested with the same restriction enzyme and 

25 purified on a 1% agarose gel. The isolated fragment and the dephosphorylated vector 
are then ligated with T4 DNA ligase. E. coli HB 101 or XL-1 Blue cells are then 
transformed and bacteria are identified that contain the fragment inserted into plasmid 

pC6 using, for instance, restriction enzyn^jmaly^is^ - — 

Chinese hamster ovary cells lacking an active DHFR gene is used for 

30 . transfection. Five \ig of the expression plasmid pC6 is cotransfected with 0.5 ^ig of 
the plasmid pSVneo using lipofectin (Feigner et al., supra). The plasmid pSV2-neo 
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contains a dominant selectable marker, the neo gene from Tn5 encoding an enzyme 
that confers resistance to a group of antibiotics including G418. The cells are seeded 
in alpha minus MEM supplemented with 1 mg/ml G418. After 2 days, the cells are 
trypsinized and seeded in hybridoma cloning plates (Greiner, Germany) in alpha 
5 minus MEM supplemented with 10, 25, or 50 ng/ml of metothrexate plus 1 mg/ml 
G418. After about 10-14 days single clones are trypsinized and then seeded in 6-well 
petri dishes or 10 ml flasks using different concentrations of methotrexate (50 nM, 
100 nM, 200 nM, 400 nM, 800 nM). Clones growing at the highest concentrations of 
methotrexate are then transferred to new 6-well plates containing even higher 
10 concentrations of methotrexate (1 |iM, 2 jllM, 5 |xM, 10 mM, 20 mM). The same 

procedure is repeated until clones are obtained which grow at a concentration of 100 - 
200 ^iM. Expression of the desired gene product is analyzed, for instance, by SDS- 
PAGE and Western blot or by reversed phase HPLC analysis. 

15 Example 9: Protein-Fusions - 

The polypeptides of the present invention are preferably fused to other 
proteins. These fusion proteins can be used for a variety of applications. For 
example, fusion of the present polypeptides to His-tag, HA-tag, protein A, IgG 
domains, and maltose binding protein facilitates purification. (See Example 5; see 

20 alsoEP A 394,827; Traunecker, et al., Nature 331:84-86 (1988).) Similarly, fusion to 
IgG-1, IgG-3, and albumin increases the halflife time in vivo. Nuclear localization 
signals fused to the polypeptides of the present invention can target the protein to a 
specific subcellular localization, while covalent heterodimer or. homodimers can 
increase or decrease the activity of a fusion protein. Fusion proteins can also create 

25 chimeric molecules having more than one function. Finally, fusion proteins can 
increase solubility and/or stability of the fused protein compared to the non-fused 
protein. All of the types of fusion proteins described above can be made by 

modifying the following ^o^^,j^ictL^utlines_the.fusion of-a polypeptide to an 

IgG molecule, or the protocol described in Example 5. 

30 Briefly, the human Fc portion of the IgG molecule can be PCR amplified, 

using primers that span the 5' and 3' ends of the sequence described below. These 
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primers also should have convenient restriction enzyme sites that will facilitate 
cloning into an expression vector, preferably a mammalian expression vector. 

For example, if pC4 (Accession No. 209646) is used, the human Fc portion 
can be ligated into the BamHI cloning site. Note that the 3' BamHI site should be 
5 destroyed. Next, the vector containing the human Fc portion is re-restricted with 

BamHI, linearizing the vector, and a polynucleotide of the present invention, isolated 
by the PCR protocol described in Example 1, is ligated into this BamHI site. Note 
that the polynucleotide is cloned without a stop codon, otherwise a fusion protein will 
not be produced. 

10 If the naturally occurring signal sequence is used to produce the secreted 

protein, pC4 does not need a second signal peptide. Alternatively, if the naturally 
occurring signal sequence is not used, the vector can be modified to include a 
heterologous signal sequence. (See, e.g., WO 96/34891.) 

15 Human IgG Fc region: - - 

GGGATCCGGAGCCCAAATCTTCTGACAAAACTCACACATGCCCACCGTGC 
CCAGCACCTGAATTCGAGGGTGCACCGTCAGTCTTCCTCTTCCCCCCAAAA 
CCCAAGGACACCCTCATGATCTCCCGGACTCCTGAGGTCACATGCGTGGT 
GGTGGACGTAAGCCACGAAGACCCTGAGGTCAAGTTCAACTGGTACGTGG 

20 ACGGCGTGGAGGTGCATAATGCCAAGACAAAGCCGCGGGAGGAGCAGTA 
CAACAGCACGTACCGTGTGGTCAGCGTCCTCACCGTCCTGCACCAGGACT 
GGCTGAATGGCAAGGAGTACAAGTGCAAGGTCTCCAACAAAGCCCTCCCA 

ACCCCCATCGAGAAAACCATCTCCAAAGCCAAAGGGCAGCeCCGAGAAC 

CACAGGTGTACACCCTGCCCCCATCCCGGGATGAGCTGACCAAGAACCAG 

25 GTCAGCCTGACCTGCCTGGTCAAAGGCTTCTATCCAAGCGACATCGCCGT 
GGAGTGGGAGAGCAATGGGCAGCCGGAGAACAACTACAAGACCACGCCT 
CCCGTGCTGGACTCCGACGGCTCCTTCTTCCTCTACAGCAAGCTCACCGTG 

GACAAGAGCAGGTGGCAGGAGGGGAAGGTCTTGTeATGCTCCGTGATGCA 

TGAGGCTCTGCACAACCACTACACGCAGAAGAGCCTCTCCCTGTCTCCGG 

30 GTAA ATG AGTGCG ACGGCCGCG ACTCTAG AGG AT (SEQ ID NO: 1 ) 
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Example 10: Production of an Antibody from a Polypeptide 

The antibodies of the present invention can be prepared by a variety of 
methods. (See, Current Protocols, Chapter 2.) For example, cells expressing a 
polypeptide of the present invention is administered to an animal to induce the 
production of sera containing polyclonal antibodies. In a preferred method, a 
preparation of the secreted protein is prepared and purified to render it substantially 
free of natural contaminants. Such a preparation is then introduced into an animal in 
order to produce polyclonal antisera of greater specific activity. 

In the most preferred method, the antibodies of the present invention are 
monoclonal antibodies (or protein binding fragments thereof). Such monoclonal 
antibodies can be prepared using hybridoma technology. (Kohler et al., Nature 
256:495 (1975); Kohler et al., Eur. J. Immunol. 6:51 1 (1976); Kohler et al., Eur. J. 
Immunol. 6:292 (1976); Hammerling et al., in: Monoclonal Antibodies and T-Cell 
Hybridomas, Elsevier, N.Y., pp. 563-681 (1981).) In general, such procedures 
involve immunizing" an animal, (preferably a mouse) with polypeptide or, more 
preferably, with a secreted polypeptide-expressing cell. Such cells may be cultured in 
any suitable tissue culture medium; however, it is preferable to culture cells in Earle's 
modified Eagle's medium supplemented with 10% fetal bovine serum (inactivated at 
about 56°C), and supplemented with about 10 g/1 of nonessential amino acids, about 
1,000 U/ml of penicillin, and about 100 |Xg/ml of streptomycin. 

The splenocytes of such mice are extracted and fused with a suitable myeloma 
cell line. Any suitable myeloma cell line may be employed in accordance with the 
present invention; howeyer,.it is preferable to employ the parent myeloma cell line 
(SP20), available from the ATCC. After fusion, the resulting hybridoma cells are 
selectively maintained in HAT medium, and then cloned by limiting dilution as 
described by Wands et al. (Gastroenterology 80:225-232 (1981).) The hybridoma 
cells obtained through such a selection are then assayed to identify clones which 

^cret^antibodies.capable of-binding -the polypeptide: 

Alternatively, additional antibodies capable of binding to the polypeptide can 
be produced in a two-step procedure using anti-idiotypic antibodies. Such a method 
makes use of the fact that antibodies are themselves antigens, and therefore, it is 
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possible to obtain an antibody which binds to a second antibody. In accordance with 
this method, protein specific antibodies are used to immunize an animal, preferably a 
mouse. The splenocytes of such an animal are then used to produce hybridoma cells, 
and the hybridoma cells are screened to identify clones which produce an antibody 
5 whose ability to bind to the protein-specific antibody can be blocked by the 
polypeptide. Such antibodies comprise anti -idiotypic antibodies to the protein- 
specific antibody and can be used to immunize an animal to induce formation of 
further protein-specific antibodies. 

It will be appreciated that Fab and F(ab')2 and other fragments of the 

10 antibodies of the present invention may be used according to the methods disclosed 
herein. Such fragments are typically produced by proteolytic cleavage, using 
enzymes such as papain (to produce Fab fragments) or pepsin (to produce F(ab')2 
fragments). Alternatively, secreted protein-binding fragments can be produced 
through the application of recombinant DNA technology or through synthetic 

15 chemistry.. "~ . 

For in vivo use of antibodies in humans, it may be preferable to use 
"humanized" chimeric monoclonal antibodies. Such antibodies can be produced 
using genetic constructs derived from hybridoma cells producing the monoclonal 
antibodies described above. Methods for producing chimeric antibodies are known in 

20 the art. (See, for review, Morrison, Science 229:1202 (1985); Oi et al M 

BioTechniques 4:214 (1986); Cabilly et al., U.S. Patent No. 4,816,567; Taniguchi et 
al., EP 171496; Morrison et al., EP 173494; Neuberger et al., WO 8601533; Robinson 
et al., WO 8702671 ; Boulianne_et al.,„Nature 3 12:643 (-1984); Neuberger et al., Nature ~ 
314:268(1985).) 

25 

Example 11: Production Of Secreted Protein For High-Throughput Screening 
Assays 

Ttejo^wing.protocol-produces a supernatant containing a ^polypeptideTo'be 

tested. This supernatant can then be used in the Screening Assays described in 
30 Examples 13-20. 
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First, dilute Poly-D-Lysine (644 587 Boehringer-Mannheim) stock solution 
(lmg/ml in PBS) 1:20 in PBS (w/o calcium or magnesium 17-5 16F Biowhittaker) for 
a working solution of 50ug/ml. Add 200 ul of this solution to each well (24 well 
plates) and incubate at RT for 20 minutes. Be sure to distribute the solution over each 
well (note: a 12-channel pipetter may be used with tips on every other channel). 
Aspirate off the Poly-D-Lysine solution and rinse with 1ml PBS (Phosphate Buffered 
Saline). The PBS should remain in the well until just prior to plating the cells and 
plates may be poly-lysine coated in advance for up to two weeks. 

Plate 293T cells (do not carry cells past P+20) at 2 x 10 5 cells/well in .5ml 
DMEM(Dulbecco's Modified Eagle Medium)(with 4.5 G/L glucose and L-glutamine 
(12-604F Biowhittaker))/ 10% heat inactivated FBS(14-503F Biowhittaker)/lx 
Penstrep(17-602E Biowhittaker). Let the cells grow overnight. 

The next day, mix together in a sterile solution basin: 300 ul Lipofectamine 
(18324-012 Gibco/BRL) and 5ml Optimem I (31985070 Gibco/BRL)/96-well plate. 
With a small volume multi-channel, pipetter, aliquot approximately 2ug of an 
expression vector containing a polynucleotide insert, produced by the methods 
described in Examples 8 or 9, into an appropriately labeled 96-well round bottom 
plate. With a multi-channel pipetter, add 50ul of the Lipofectamine/Optimem I 
mixture to each well. Pipette up and down gently to mix. Incubate at RT 15-45 
minutes. After about 20 minutes, use a multi-channel pipetter to add 150ul Optimem 
I to each well. As a control, one plate of vector DNA lacking an insert should be 
transfected with each set of transfections. 

Preferably, the transfection shpuld be performed by-tag-teaming the following 
tasks. By tag-teaming, hands on time is cut in half, and the cells do not spend too 
much time on PBS. First, person A aspirates off the media from four 24-well plates 
of cells, and then person B rinses each well with .5-lml PBS. Person A then aspirates 
off PBS rinse, and person B, using al2-channel pipetter with tips on every other 
channel, adds the 2W)ul_ofJDNA/Lipofecta 

first, then to the even wells, to each row on the 24-well plates. Incubate at 37°C for 6 
hours. 
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While cells are incubating, prepare appropriate media, either 1%BSA in 
DMEM with Ix penstrep, or CHO-5 media (1 16.6 mg/L of CaC12 (anhyd); 0.00130 
mg/L CuS0 4 -5H 2 0; 0.050 mg/L of Fe(N0 3 ) 3 -9H 2 0; 0.417 mg/L of FeS0 4 -7H 2 0; 
31 1.80 mg/L of Kcl; 28.64 mg/L of MgCl 2 ; 48.84 mg/L of MgS0 4 ; 6995.50 mg/L of 
5 NaCl; 2400.0 mg/L of NaHC0 3 ; 62.50 mg/L of NaH 2 PO 4 -H 2 0; 71.02 mg/L of 

Na 2 HP04; .4320 mg/L of ZnS0 4 -7H 2 0; .002 mg/L of Arachidonic Acid ; 1.022 mg/L 
of Cholesterol; .070 mg/L of DL-alpha-Tocopherol-Acetate; 0.0520 mg/L of Linoleic 
Acid; 0.010 mg/L of Linolenic Acid; 0.010 mg/L of Myristic Acid; 0.010 mg/L of 
Oleic Acid; 0.010 mg/L of Palmitric Acid; 0.010 mg/L of Palmitic Acid; 100 mg/L of 
10 Pluronic F-68; 0.010 mg/L of Stearic Acid; 2.20 mg/L of Tween 80; 455 1 mg/L of D- 
Glucose; 130.85 mg/ml of L- Alanine; 147.50 mg/ml of L-Arginine-HCL; 7.50 mg/ml 
of L-Asparagine-H 2 0; 6.65 mg/ml of L-Aspartic Acid; 29.56 mg/ml of L-Cystine- 
2HCL-H 2 0; 31.29 mg/ml of L-Cystine-2HCL; 7.35 mg/ml of L-Glutamic Acid; 365.0 
mg/ml of L-Glutamine; 18.75 mg/ml of Glycine; 52.48 mg/ml of L-Histidine-HCL- 
15 H 2 0; 106.97 mg/mhof L : IsoIeucine; 1 11.45 mg/ml of L-Leucine; 163.75 mg/ml of L- 
Lysine HCL; 32.34 mg/ml of L-Methionine; 68.48 mg/ml of L-Phenylalainine; 40.0 
mg/ml of L-Proline; 26.25 mg/ml of L-Serine; 101.05 mg/ml of L-Threonine; 19.22 
mg/ml of L-Tryptophan; 91 .79 mg/ml of L-Tryrosine-2Na-2H 2 0; 99.65 mg/ml of L- 
Valine; 0.0035 mg/L of Biotin; 3.24 mg/L of D-Ca Pantothenate; 1 1 .78 mg/L of 
20 Choline Chloride; 4.65 mg/L of Folic Acid; 15.60 mg/L of i-Inositol; 3.02 mg/L of 
Niacinamide; 3.00 mg/L of Pyridoxal HCL; 0.031 mg/L of Pyridoxine HCL; 0.319 
mg/L of Riboflavin; 3.17 mg/L of Thiamine HCL; 0,365 mg/L of Thymidine; and 

0.680 mg/L of Vitamin B I2 ; 25 mM of HEPES Buffer; 2.39 mg/L of Na 

Hypoxanthine; 0. 105 mg/L of Lipoic Acid; 0.08 1 mg/L of Sodium Putrescine-2HCL; 
25 55.0 mg/L of Sodium Pyruvate; 0.0067 mg/L of Sodium Selenite; 20uM of 

Ethanolamine; 0.122 mg/L of Ferric Citrate; 41.70 mg/L of Methyl-B-Cyclodextrin 
complexed with Linoleic Acid; 33.33 mg/L of Methyl-B-Cyclodextrin complexed 
with Oleic Aci d;^cLLO,mg/L,of-Methvl-B-Gyclodextrin-complexed with"Retinan 
with 2mm glutamine and lx penstrep. (BSA (81-068-3 Bayer) lOOgm dissolved in 1L 
30 DMEM for a 10% BSA stock solution). Filter the media and collect 50 ul for 
endotoxin assay in 15ml polystyrene conical. 
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The transfection reaction is terminated, preferably by tag-teaming, at the end 
of the incubation period. Person A aspirates off the transfection media, while person 
B adds 1.5ml appropriate media to each well. Incubate at 37°C for 45 or 72 hours 
depending on the media used: 1%BSA for 45 hours or CHO-5 for 72 hours. 

On day four, using a 300ul multichannel pipetter, aliquot 600ul in one 1ml 
deep well plate and the remaining supernatant into a 2ml deep well. The supernatants 
from each well can then be used in the assays described in Examples 13-20. 

It is specifically understood that when activity is obtained in any of the assays 
described below using a supernatant, the activity originates from either the 
polypeptide directly (e.g., as a secreted protein) or by the polypeptide inducing 
expression of other proteins, which are then secreted into the supernatant. Thus,~the 
invention further provides a method of identifying the protein in the supernatant 
characterized by an activity in a particular assay. 

Example 12: Construction of GAS Reporter Construct 

One signal transduction pathway involved in the differentiation and 
proliferation of cells is called the Jaks-STATs pathway. Activated proteins in the 
Jaks-STATs pathway bind to gamma activation site "GAS" elements or interferon- 
sensitive responsive element ("ISRE"), located in the promoter of many genes. The 
binding of a protein to these elements alter the expression of the associated gene. 

GAS and ISRE elements are recognized by a class of transcription factors 
called Signal Transducers and Activators of Transcription, or "STATs." There are six 
members "of the STATs family. Statl and Stat3 are present in many cell types, as is 
Stat2 (as response to IFN-alpha is widespread). Stat4 is more restricted and is not in 
many cell types though it has been found in T helper class I, cells after treatment with 
IL-12. Stat5 was originally called mammary growth factor, but has been found at 
higher concentrations in other cells including myeloid cells. It can be activated in 
tissue culture cells by many cytokines. 

The STATs are activated to translocate from the cytoplasm to the nucleus 
upon tyrosine phosphorylation by a set of kinases known as the Janus Kinase ("Jaks") 
r family. Jaks represent a distinct family of soluble tyrosine kinases and include Tyk2, 
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Jakl, Jak2, and Jak3. These kinases display significant sequence similarity and are 
generally catalytically inactive in resting cells. 

The Jaks are activated by a wide range of receptors summarized in the Table 
below. (Adapted from review by Schidler and Darnell, Ann. Rev. Biochem. 64:621- 
51 (1995).) A cytokine receptor family, capable of activating Jaks, is divided into two 
groups: (a) Class 1 includes receptors for IL-2, IL-3, EL-4, IL-6, IL-7, IL-9, IL-1 1, IL- 
12, IL-1 5, Epo, PRL, GH, G-CSF, GM-CSF, LIF, CNTF, and thrombopoietin; and (b) 
Class 2 includes IFN-a, IFN-g, and IL-10. The Class 1 receptors share a conserved 
cysteine motif (a set of four conserved cysteines and one tryptophan) and a WSXWS 
motif (a membrane proximal region encoding Trp-Ser-Xxx-Trp-Ser (SEQ ID NO:2)). 

Thus, on binding of a ligand to a receptor, Jaks are activated, which in turn 
activate ST ATs, which then. translocate and bind to GAS elements. This entire 
process is encompassed in the Jaks-STATs signal transduction pathway. 

Therefore, activation of the Jaks-STATs pathway, reflected by the binding of 
the GAS or the ISRE element, can be used to indicate proteins involved in the 
proliferation and differentiation of cells. For example, growth factors and cytokines 
are known to activate the Jaks-STATs pathway. (See Table below.) Thus, by using 
GAS elements linked to reporter molecules, activators of the Jaks-STATs pathway 
can be identified. 
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To construct a synthetic GAS containing promoter element, which is used in 
the Biological Assays described in Examples 13-14, a PCR based strategy is 
employed to generate a GAS-SV40 promoter sequence. The 5' primer contains four 
tandem copies of the GAS binding site found in the IRF1 promoter and previously 
demonstrated to bind STATs upon induction with a range of cytokines (Rothman et 
al., Immunity 1:457-468 (1994).), although other GAS or ISRE elements can be used 
instead. The 5' primer also contains 18bp of sequence complementary to the SV40 
early promoter sequence and is flanked with an Xhol site. The sequence of the 5* 
primer is: 

5 ' :GCGCCTCG AG ATTTCCCCGAA ATCTAG ATTTCCCCGA AATGATTTCCCC 
GAAATG ATTTCCCCG A AATATCTGCCATCTCAATT AG: 3 ' (SEQ ID NO:3) 

The downstream primer is complementary to the SV40 promoter and is 
flanked with a Hind III site: 5 ' : GCGGC A AGCTTTTTGC A A AGCCTAGGC: 3 * 
(SEQ ID NO:4) 

PCR amplification is performed using the S V40 promoter template present in 
the B-gal .-promoter plasmid obtained from Clontech. The resulting PCR fragment is 
digested with Xhol/Hind III and subcloned into BLSK2-. (Stratagene.) Sequencing 
with forward and reverse primers confirms that the insert contains the following 
sequence: 

5 ' : CTCGAG ATTTCCCCG A A ATCTAG ATTTCCCCG AAATG ATTTCCCCG AAA 
TGATTTCCCCGAAATATCTGCCATCTCAATTAGTCAGCAACCATAGTCCCG 
CCCCTAACTCCGCCCATCCCGCCCCTAACTCCGCCCAGTTCCGCCCATTCT 
CCGCCCCATGGCTGACTAA1T1T1T 

TCGGCCTCTGAGCTATTCCAGAAGTAGTGAGGAGGCTTTTTTGGAGGCCT 
AGGCTTTTGCAAAAAGCTT:3* (SEQIDNO:5) 

With this GAS promoter element linked to the SV40 promoter, a GAS:SEAP2 
reporter construct is next engineered. Here, the reporter molecule is a secreted 

alkaline^phosphat ase, or "SEAP " C learlv.-however.-any-reporter mol&culfrcan he 

instead of SEAP, in this or in any of the other Examples. Well known reporter 
molecules that can be used instead of SEAP include chloramphenicol 
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acetyltransferase (CAT), luciferase, alkaline phosphatase, B-galactosidase, green 
fluorescent protein (GFP), or any protein detectable by an antibody. 

The above sequence confirmed synthetic GAS-SV40 promoter element is 
subcloned into the pSEAP-Promoter vector obtained from Clontech using Hindin and 
Xhol, effectively replacing the SV40 promoter with the amplified GAS:SV40 
promoter element, to create the GAS-SEAP vector. However, this vector does not 
contain a neomycin resistance gene, and therefore, is not preferred for mammalian 
expression systems. 

Thus, in prder to generate mammalian stable cell lines expressing the GAS- 
SEAP reporter, the GAS-SEAP cassette is removed from the GAS-SEAP vector using 
Sail and NotI, and inserted into a backbone vector containing the neomycin resistance 
gene, such as pGFP-1 (Clontech), using these restriction sites in the multiple cloning 
site, to create the GAS-SEAP/Neo vector. Once this vector is transfected into 
mammalian cells, this vector can then be used as a reporter molecule for GAS binding 
as described in Examples 13-14.; ■ . -"" " 

Other constructs can be made using the above description and replacing GAS 
with a different promoter sequence. For example, construction of reporter molecules 
containing NFK-B and EGR promoter sequences are described in Examples 15 and 
16. However, many other promoters can be substituted using the protocols described 
in these Examples. For instance, SRE, EL-2, NFAT, or Osteocalcin promoters can be 
substituted, alone or in combination (e.g., GAS/NF-KB/EGR, GAS/NF-KB, II- 
2/NFAT, or NF-KB/GAS). Similarly, other cell lines can be used to test reporter 
construct activity, such as HELA (epithelial), HUVEC (endothelial); Reh (B-cell), " 
Saos-2~(osteoblast), HUVAC (aortic), or Cardiomyocyte. 

Example 13: High-Th roughput Screening Assay for T-cell Activity. 

The following protocol is used to assess T-cell activity by identifying factors, 
such as growth factors and cytokine^^ 

cell activity is assessed using the GAS/SEAP/Neo construct produced in Example 12. 
Thus, factors that increase SEAP activity indicate the ability to activate the Jaks- 
STATS signal transduction pathway. The T-cell used in this assay is Jurkat T-cells 
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(ATCC Accession No. TIB- 152), although Molt-3 cells (ATCC Accession No. CRL- 
1552) and Molt-4 cells (ATCC Accession No. CRL-1582) cells can also be used. 

Jurkat T-cells are lymphoblastic CD4+ Thl helper cells. In order to generate 
stable cell lines, approximately 2 million Jurkat cells are transfected with the GAS- 
5 SEAP/neo vector using DMRIE-C (Life Technologies)(transfection procedure 
described below). The transfected cells are seeded to a density of approximately 
20,000 cells per well and transfectants resistant to 1 mg/ml genticin selected. 
Resistant colonies are expanded and then tested for their response to increasing 
concentrations of interferon gamma. The dose response of a selected clone is 

10 demonstrated. 

Specifically, the following protocol will yield sufficient cells for 75 wells 
containing 200 ul of cells. Thus, it is either scaled up, or performed in multiple to 
generate sufficient cells for multiple 96 well plates. Jurkat cells are maintained in 
RPMI + 10% serum with l%Pen-Strep. Combine 2.5 mis of OPTI-MEM (Life 

15 Technologies) withTO ug of plasmid DNA in a T25 flask. Add 2.5 ml OPTI-MEM 
containing 50 ul of DMRIE-C and incubate at room temperature for 15-45 mins. 

During the incubation period, count cell concentration, spin down the required 
number of cells (10 7 per transfection), and resuspend in OPTI-MEM to a final 
concentration of 10 7 cells/ml. Then add 1ml of 1 x 10 7 cells in OPTI-MEM to T25 

20 flask and incubate at 37°C for 6 hrs. After the incubation, add 10 ml of RPMI + 15% 
serum. 

The Jurkat:GAS-SEAP stable reporter lines are maintained in RPMI + 10% 

. serum, 1 mg/ml Genticin, and 1% Pen-Strep. These cells are treated with 

supernatants containing a polypeptide as produced by the protocol described in 
25 Example 11. 

. On the day of treatment with the supernatant, the cells should be washed and 
resuspended in fresh RPMI + 10% serum to a density of 500,000 cells per ml. The 
exact number of cells required wiUjiependj2n_the-number 

screened. For one 96 well plate, approximately 10 million cells (for 10 plates, 100 
30 million cells) are required. 
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Transfer the cells to a triangular reservoir boat, in order to dispense the cells 
into a 96 well dish, using a 12 channel pipette. Using a 12 channel pipette, transfer 
200 ul of cells into each well (therefore adding 100, 000 cells per well). 

After all the plates have been seeded, 50 ul of the supernatants are transferred 
5 directly from the 96 well plate containing the supernatants into each well using a 12 
channel pipette. In addition, a dose of exogenous interferon gamma (0.1, 1.0, 10 ng) 
is added to wells H9, H10, and HI 1 to serve as additional positive controls for the 
assay. 

The 96 well dishes containing Jurkat cells treated with supernatants are placed 
10 in an incubator for 48 hrs (note: this time is variable between 48-72 hrs). 35 ul 
samples from each well are then transferred to an opaque 96 well plate using a 12 
channel pipette. The opaque plates should be covered (using sellophene covers) and 
stored at -20°C until SEAP assays are performed according to Example 17. The 
plates containing the remaining, treated cells are placed at 4°C and serve as a source 
15 of material for repeating the assay on a specific well if desired. 

As a positive control, 100 Unit/ml interferon gamma can be used which is 
known to activate Jurkat T cells. Over 30 fold induction is typically observed in the 
positive control wells. 

The above protocol may be used in the generation of both transient, as well as, 
20 stable transfected cells, which would be apparent to those of skill in the art. 

Example 14: High-Throughput Screening Assay Identifying Myeloid Activity 

The following protocol is used to assess myeloid activity by identifying 

factors, such as growth factors and cytokines, that may proliferate or differentiate 

25 myeloid cells. Myeloid cell activity is assessed using the GAS/SEAP/Neo construct 
produced in Example 12. Thus, factors that increase SEAP activity indicate the 
ability to activate the Jaks-STATS signal transduction pathway. The myeloid cell 

used4n-this-assay-is-U9377a"pre-monocyte celOine, although TF-1, HL60, or KG1 

can be used. 

30 To transiently transfect U937 cells with the GAS/SEAP/Neo construct 

produced in Example 12, a DEAE-Dextran method (Kharbanda et. al., 1994, Cell 
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Growth & Differentiation, 5:259-265) is used. First, harvest 2x1 Oe 7 U937 cells and 
wash with PBS. The U937 cells are usually grown in RPMI 1640 medium containing 
10% heat-inactivated fetal bovine serum (FBS) supplemented with 100 units/ml 
penicillin and 100 mg/ml streptomycin. 
5 Next, suspend the cells in 1 ml of 20 mM Tris-HCl (pH 7.4) buffer containing 

0.5 mg/ml DEAE-Dextran78 ug GAS-SEAP2 plasmid DNA, 140 mM NaC175 mM 
KC1, 375 uM Na 2 HP0 4 .7H20, 1 mM MgCl 2 , and 675 uM CaCl 2 . Incubate at 37°C 
for 45 min. 

Wash the cells with RPMI 1640 medium containing 10% FBS and then 
10 resuspend in 10 ml complete medium and incubate at 37°C for 36 hr. 

The GAS-SEAP/U937 stable cells are obtained by growing the cells in 400 
ug/ml G418. The G418-free medium is used for routine growth but every one to two 
months, the cells should be re-grown in 400 ug/ml G418 for couple of passages. 

g 

These cells are tested by harvesting .1x10 cells (this, is enough for ten 96- well 
15 plates assay) and wash with PBS. Suspend the cells in 200 ml above described 

growth medium, with a final density of 5xl0 5 cells/ml. Plate 200 ul cells per well in 
the 96-weIl plate (or 1x10 s cells/well). 

Add 50 ul of the supernatant prepared by the protocol described in Example 
11. Incubate at 37°C for 48 to 72 hr. As a positive control, 100 Unit/ml interferon 
20 gamma can be used which is known to activate U937 cells. Over 30 fold induction is 
typically observed in the positive control wells. SEAP assay the supernatant 
according to the protocol described in Example 17. _ - 

Example 15: High-Throughput Screening Assay Identifying Neuronal Activity. 

25 When cells undergo differentiation and proliferation, a group of genes are 

activated through many different signal transduction pathways. One of these genes, 

EGR1 (early growth response gene I), is induced in various_tissues-and cell-types 

upon activation. The promoter of EGR1 is responsible for such induction. Using the 
EGR1 promoter linked to reporter molecules, activation of cells can be assessed. 
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Particularly, the following protocol is used to assess neuronal activity in PC 12 
cell lines. PC 12 cells (rat phenochromocytoma cells) are known to proliferate and/or 
differentiate by activation with a number of mitogens, such as TPA (tetradecanoyl 
phorbol acetate), NGF (nerve growth factor), and EGF (epidermal growth factor). 
5 The EGR1 gene expression is activated during this treatment. Thus, by stably 

transfecting PC 12 cells with a construct containing an EGR promoter linked to SEAP 
reporter, activation of PC 12 cells can be assessed. 

The EGR/SEAP reporter construct can be assembled by the following 
protocol. The EGR-1 promoter sequence (-633 to +l)(Sakamoto K et al., Oncogene 
10 6:867-871 (1991)) can be PCR amplified from human genomic DNA using the 
following primers: 

5' GCGCTCGAGGGATGACAGCGATAGAACCCCGG -3' (SEQ ID NO:6) 
5' GCGAAGCTTCGCGACTCCCCGGATCCGCCTC-3' (SEQ ID NO:7) 
Using the GAS:SEAP/Neo vector produced in Example 12, EGR1 amplified 
15 product can then be4nseited into this vector. Linearize the GAS: SEAP/Neo vector 
using restriction enzymes Xhol/Hindlll, removing the GAS/SV40 stuffer. Restrict the 
EGR1 amplified product with these same enzymes. Ligate the vector and the EGR1 
promoter. 

To prepare 96 well-plates for cell culture, two mis of a coating solution (1:30 
20 dilution of collagen type I (Upstate Biotech Inc. Cat#08-1 15) in 30% ethanol (filter 
sterilized)) is added per one 10 cm plate or 50 ml per well of the 96- well plate, and 
allowed to air dry for 2 hr. 

PC12 cells are routinely grown in RPMI-1640 medium (Bio^Whittaker) 

containing 10% horse serum (JRH BIOSCIENCES, Cat. # 12449-78P), 5% heat- 
25 inactivated fetal bovine serum (FBS) supplemented with 100 units/ml penicillin and 
100 ug/ml streptomycin on a precoated 10 cm tissue culture dish. One to four split is 
done every three to four days. Cells are removed from the plates by scraping and 

resuspended with pipetting up and down f or more th an J5-times. 

Transfect the EGR/SEAP/Neo construct into PC 12 using the Lipofectamine 
30 protocol described in Example 1 1 . EGR-SEAP/PC12 stable cells are obtained by 
growing the cells in 300 ug/ml G418. The G418-free medium is used for routine 
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growth but every one to two months, the cells should be re-grown in 300 ug/ml G418 
for couple of passages. 

To assay for neuronal activity, a 10 cm plate with cells around 70 to 80% 
confluent is screened by removing the old medium. Wash the cells once with PBS 
5 (Phosphate buffered saline). Then starve the cells in low serum medium (RPMI-1640 
containing-1-% -horse serum and 0:5%-FBS~with antibiotics) overnight. 

The next morning, remove the medium and wash the cells with PBS. Scrape 
off the cells from the plate, suspend the cells well in 2 ml low serum medium. Count 

the cell number and add more low serum medium to reach final cell density as 5x1 0 5 
10 cells/ml. 

Add 200 ul of the cell suspension to each well of 96-well plate (equivalent to 
lxlO 5 cells/well). Add 50 ul supernatant produced by Example 11, 37°C for 48 to 72 
hr. As a positive control, a growth factor known to activate PC 12 cells through EGR 
can be used, such as 50 ng/ul of Neuronal Growth Factor (NGF). Over fifty-fold 
15 induction of SEAP is typically seen in the positive control wells. SEAP assay the 
supernatant according to Example 17. 

Example 16: High-Throughput Screening Assay for T-cell Activity 

NF-kB (Nuclear Factor kB) is a transcription factor activated by a wide 
variety of agents including the inflammatory cytokines IL-1 and TNF, CD30 and 
CD40, lymphotoxin-alpha and lymphotoxin-beta, by exposure to LPS or thrombin, 
and by expression of certain viral gene products. As a transcription factor, NF-kB 
regulates the expression of genes involved in immune cell activation; control of 
apoptosis (NF- kB appears to shield cells from apoptosis), B and T-cell development, 
anti-viral and antimicrobial responses, and multiple stress responses. 

In non-stimulated conditions, NF- kB is retained in the cytoplasm with I-kB 
(Inhibitor kB). However, upon stimulation, I- kB is phosphorylated and degraded, 
cau^i^J^^i^j^shuttle_tO-the nucleusrthereby'a^tiv^fin^ansci^tion of target 
genes. Target genes activated by NF- kB include EL-2, IL-6, GM-CSF, ICAM-1 and 
class 1 MHC. 
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Due to its central role and ability to respond to a range of stimuli, reporter 
constructs utilizing the NF-kB promoter element are used to screen the supernatants 
produced in Example 11. Activators or inhibitors of NF-kB would be useful in 
treating diseases. For example, inhibitors of NF-kB could be used to treat those 
diseases related to the acute or chronic activation of NF-kB, such as rheumatoid 
arthritis. 

To construct a vector containing the NF-kB promoter element, a PCR based 
strategy is employed. The upstream primer contains four tandem copies of the NF-kB 
binding site (GGGGACTTTCCC) (SEQ ID NO:8), 18 bp of sequence complementary 
to the 5' end of the SV40 early promoter sequence, and is flanked with an Xhol site: 
S^GCGGCCTCGAGGGGACTTTCCCGGGGACTTTCCGGGGACTTTCCGGGAC 
TTTCC ATCCTG CC ATCTC A ATT AG : 3 ' (SEQ ID NO:9) 

The downstream primer is complementary to the 3' end of the SV40 promoter 
and is flanked with a Hind III site: 

5 ' : GCGGC AAGCTTTTTGC AAACjCCTAGGC:3 ' (SEQ ID NO:4) 

PCR amplification is performed using the SV40 promoter template present in 
the pB-gal:promoter plasmid obtained from Clontech. The resulting PCR fragment is 
digested with Xhol and Hind III and subcloned into BLSK2-. (Stratagene) 
Sequencing with the T7 and T3 primers confirms the insert contains the following 
sequence: 

s^ctcgaggggactttcccggggactttccggggactttccgggactttcc 
atctgccatctcaattagtcagcaaccatagtcccgcccctaagtccgccc 

ATCCCGCCCCTAACfCCGCCCAGTTCCGCCCATTCTCCGCCCCATGGCTGA 
CTAAriT'lU"Tl"l ATTTATGCAGAGGCCGAGGCCGCCTCGGCCTCTGAGCTA 
TTCCAGAAGTAGTGAGGAGGCTTTTTTGG 
GCTT:3' (SEQ ID NO: 10) 



Next, replace the S V40 minimal promoter element present in the pSEAP2- 
promoter plasmid (Clontech) with this NF-KB/SV40 fragment using Xhol and 
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Hindlll. However, this vector does not contain a neomycin resistance gene, and 
therefore, is not preferred for mammalian expression systems. 

In order to generate stable mammalian cell lines, the NF-KB/SV40/SEAP 
cassette is removed from the above NF-kB/SEAP vector using restriction enzymes 
5 Sail and NotI, and inserted into a vector containing neomycin resistance. Particularly, 
the NF-KB/SV40/SEAP cassette was inserted into pGFP-1 (Clontech), replacing the 
GFP gene, after restricting pGFP-1 with Sail and NotI. 

Once NF-kB/S V40/SEAP/Neo vector is created, stable Jurkat T-cells are 
created and maintained according to the protocol described in Example 13. Similarly, 
10 the method for assaying supematants with these stable Jurkat T-cells is also described 
in Example 13. As a positive control, exogenous TNF alpha (0.1,1, 10 rig) is added to 
wells H9, H10, and HI 1, with a 5-10 fold activation typically observed. 

Example 17: Assay for SEAP Activity 

15 As a reporter molecule for the assays described in Examples 13-16, SEAP 

activity is assayed using the Tropix Phospho-light Kit (Cat. BP-400) according to the 
following general procedure. The Tropix Phospho-light Kit supplies the Dilution, 
Assay, and Reaction Buffers used below. 

Prime a dispenser with the 2.5x Dilution Buffer and dispense 15 |il of 2.5x 

20 dilution buffer into Optiplates containing 35 |xl of a supernatant. Seal the plates with 
a plastic sealer and incubate at 65°C for 30 min. Separate the Optiplates to avoid 
uneven heating. 

Cool the samples to room temperature for 15 minutes. -Empty the dispenser " 
and prime with the Assay Buffer. Add 50 yd Assay Buffer and incubate at room 
25 temperature 5 min. Empty the dispenser and prime with the Reaction Buffer (see the 
table below). Add 50 jxl Reaction Buffer and incubate at room temperature for 20 
minutes. Since the intensity of the chemiluminescent signal is time dependent, and it 
takes about 10 minutes toj^ad^platesjDn luminometerv one-should tfearS^lates at 
" each time and start the second set 10 minutes later. 
30 Read the relative light unit in the luminometer. Set H12 as blank, and print 

the results. An increase in chemiluminescence indicates reporter activity. 
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Reaction Buffer Formulation: 


# of plates 


Kxn Duller uiiuent (ml) 


CbPD (ml) 


10 


60 


3 


1 1 


65 


3.25 


12 


70 


3.5 


13 


75 


3.75 


14 


80 


4 


15 


85 


4.25 


16 


90 


4.5 


17 


95 


4.75 


18 


100 


5 


19 


105 


5.25 


20 


110 


5.5 


21 


115 


5.75 


22 


120 


6 


23 


125 


6.25 


24 


130 


6.5 


25 


135 


6.75 


26 


140 


7 


27 


145 


7.25 


28 


150 


7.5 


29 


155 


7.75 


30 


160 . . 


8 


31 


165. 


8.25 


32 


170 


8.5 


33 


175 


8.75 


34 


180 


9 


35 


185 


9.25 


36 


190 


9.5 


37 


195 


9.75 


38 


200 


10 




90S 




40 


210 


10.5 


41 


215 


10.75 


42 


220 


11 


43 


225 


11.25 


44 


230 


11.5 


45 


235 


11.75 


46 


240 . 


12- 


47 " 


245 


12.25 


48 


250 


12.5 


49 


255 


12.75 


50 


260 


13 



Example 18: High-Throughput Screening Assay Identifying Changes in Small 

Molecule Concentration and Membrane-Permeability 

Binding of a ligand to a receptor is known to alter intracellular levels of small 
molecules, such as calcium, potassium, sodium, and pH, as well as alter membrane 
potential. These alterations can be measured in an assay to identify supernatants 
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which bind to receptors of a particular cell. Although the following protocol 
describes an assay for calcium, this protocol can easily be modified to detect changes 
in potassium, sodium, pH, membrane potential, or any other small molecule which is 
detectable by a fluorescent probe. 

5 The following assay uses Fluorometric Imaging Plate Reader ("FLIPR") to 

measure changes in fluorescent molecules (Molecular Probes) that bind small 
molecules. Clearly, any fluorescent molecule detecting a small molecule can be used 
instead of the calcium fluorescent molecule, fluo-4 (MDlecular Probes, inc.; 
catalog no. F-14202) , used here. 

10 For adherent cells, seed the cells at 10,000 -20,000 cells/well in a Co-star 

black 96-well plate with clear bottom. The plate is incubated in a C0 2 incubator for 
20 hours. The adherent cells are washed two times in Biotek washer with 200 ul of 
HBSS (Hank's Balanced Salt Solution) leaving 100 ul of buffer after the final wash. 
A stock solution of 1 mg/ml fluo-4 is made in 10% pluronic acid DMSO. To 

15 load the cells with~fluo-4 , 50 uLof 12 ug/ml fluo-4 is added to each well. The plate 
is incubated at 37°C in a C0 2 incubator for 60 min. The plate is washed four times in 
the Biotek washer with HBSS leaving 100 ul of buffer. 

For non-adherent cells, the cells are spun down from culture media. Cells are 
re-suspended to 2-5xl0 6 cells/ml with HBSS in a 50-ml conical tube. 4 ul of 1 mg/ml 

20 fluo-4 solution in 10% pluronic acid DMSO is added to each ml of cell suspension. 
The tube is then placed in a 37°C water bath for 30-60 min. The cells are washed 
twice with HBSS, resuspended to lxl 0 6 cells/ml, and dispensed into a microplate, 100 
ul/well. The platens centrifuged at 1000 rpm for 5 min. The plate is then washed^ _ 
once in Denley CellWash with 200 ul, followed by an aspiration step to 100 ul final 

25 volume. 

For a non-cell based assay, each well contains a fluorescent molecule, such as 
fluo-4 . The supernatant is added to the well, and a change in fluorescence is 

detected, 

To measure the fluorescence of intracellular calcium, the FLIPR is set for the 
30 following parameters: (1) System gain is 300-800 mW; (2) Exposure time is 0.4 
second; (3) Camera F/stop is F/2; (4) Excitation is 488 nm; (5) Emission is 530 nm; 
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and (6) Sample addition is 50 ul. Increased emission at 530 nm indicates an 
extracellular signaling event which has resulted in an increase in the intracellular 

Ca* 4 ' concentration. 

5 Example 19: High-Throughput Screening Assay Identifying Tyrosine Kinase 
Activity 

The Protein Tyrosine Kinases (PTK) represent a diverse group of 
transmembrane and cytoplasmic kinases. Within the Receptor Protein Tyrosine 
Kinase RPTK) group are receptors for a range of mitogenic and metabolic growth 

10 factors including the PDGF, FGF, EGF, NGF, HGF and Insulin receptor subfamilies. 
In addition there are a large family of RPTKs for which the corresponding ligand is 
unknown. Ligands for RPTKs include mainly secreted small proteins, but also 
membrane-bound and extracellular matrix proteins. 

Activation of RPTK by ligands involves ligand-mediated receptor 

15 dimerization, resulting in transphosphorylation of the receptor subunits and activation 
of the cytoplasmic tyrosine kinases. The cytoplasmic tyrosine kinases include 
receptor associated tyrosine kinases of the src-family (e.g., src, yes, lck, lyn, fyn) and 
non-receptor linked and cytosolic protein tyrosine kinases, such as the Jak family, 
members of which mediate signal transduction triggered by the cytokine superfamily 

20 of receptors (e.g., the Interleukins, Interferons, GM-CSF, and Leptin). 

Because of the wide range of known factors capable of stimulating tyrosine 
kinase activity, the identification of novel human secreted proteins capable of 
activating tyrosine .kinase, signal transduction pathwaysareof interest. -Therefore, the 
following protocol is designed to identify those novel human secreted proteins 

25 capable of activating the tyrosine kinase signal transduction pathways. 

Seed target cells (e.g., primary keratinocytes) at a density of approximately 
25,000 cells per well in a 96 well Loprodyne Silent Screen Plates purchased from 

Nalge Nunc (Naperville,IL). - The-plates-are-sterilized with-two-30-minute rinses with 

100% ethanol, rinsed with water and dried overnight. Some plates are coated for 2 hr 

30 with 100 ml of cell culture grade type I collagen (50 mg/ml), gelatin (2%) or 

polylysine (50 mg/ml), all of which can be purchased from Sigma Chemicals (St. 
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Louis, MO) or 10% Matrigel purchased from Becton Dickinson (Bedford,MA), or 

calf serum, rinsed with PBS and stored at 4°C. Cell growth on these plates is assayed 
by seeding 5,000 cells/well in growth medium and indirect quantitation of cell 
number through use of alamarBlue as described by the manufacturer Alamar 
5 Biosciences, Inc. (Sacramento, CA) after 48 hr. Falcon plate covers #3071 from 
Becton Dickinson (Bedford,MA) are used to cover the Loprodyne Silent Screen 
Plates. Falcon Microtest m cell culture plates can also be used in some proliferation 
experiments. 

To prepare extracts, A431 cells are seeded onto the nylon membranes of 
10 Loprodyne plates (20,000/200ml/well) and cultured overnight in complete medium. 
Cells are quiesced by incubation in serum-free basal medium for 24 hr. After 5-20 
minutes treatment with EGF (60ng/ml) or 50 ul of the supernatant produced in 
Example 1 1 , the medium was removed and 100 ml of extraction buffer ((20 mM 
HEPES pH 7.5, 0.15 M NaCl, 1% Triton X-100, 0.1% SDS, 2 mM Na3V04, 2 mM 
15 Na4P207 and a cocktail of protease inhibitors (# 1836170) obtained from 

Boeheringer Mannheim (Indianapolis, IN) is added to each well and the plate is 

shaken on a rotating shaker for 5 minutes at 4°C. The plate is then placed in a 
vacuum transfer manifold and the extract filtered through the 0.45 mm membrane 
bottoms of each weli using house vacuum. Extracts are collected in a 96-well 
20 catch/assay plate in the bottom of the vacuum manifold and immediately placed on 
ice. To obtain extracts clarified by centrifugation, the content of each well, after 
detergent solubilization for 5 minutes, is removed and centrifuged for 15 minutes at 

4°C at 16,000 x"g7 " ~ " " ~ 

Test the filtered extracts for levels of tyrosine kinase activity. Although many 
25 methods of detecting tyrosine kinase activity are known, one method is described 
here. 

Generally, the tyrosine kinase activity of a supernatant is evaluated by 
determining its ability to phosphorylate a tyrosine residue on a specific substrate (a 
biotinylated peptide). Biotinylated peptides that can be used for this purpose include 
30 PSK1 (corresponding to amino acids 6-20 of the cell division kinase cdc2-p34) and 
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PSK2 (corresponding to amino acids 1-17 of gastrin). Both peptides are substrates for 
a range of tyrosine kinases and are available from Boehringer Mannheim. 

The tyrosine kinase reaction is set up by adding the following components in 
order. First, add lOul of 5uM Biotinylated Peptide, then lOul ATP/Mg2+ (5mM 
5 ATP/50mM MgCl2), then lOul of 5x Assay Buffer (40mM imidazole hydrochloride, 
pH7.3, 40 mM beta-glycerophosphate, ImM EGTA, lOOmM MgCl2, 5 mM MnC^, 
0.5 mg/ml BSA), then 5ul of Sodium Vanadate(lmM), and then 5ul of water. Mix the 

components gently and preincubate the reaction mix at 30°C for 2 min. Initial the 
reaction by adding lOul of the control enzyme or the filtered supernatant. 
10 The tyrosine kinase assay reaction is then terminated by adding 10 ul of 

120mm EDTA and place the reactions on ice. 

Tyrosine kinase activity is determined by transferring 50 ul aliquot of reaction 

mixture to a microtiter plate (MTP) module and incubating at 37°C for 20 min. This 
allows the streptavadin coated 96 well-plate to associate with the biotinylated peptide. 
15 Wash the MTP module with 300ul/well of PBS four times. Next add 75 ul of anti- 
phospotyrosine antibody conjugated to horse radish peroxidase(anti-P-Tyr- 

POD(0.5u/ml)) to each well and incubate at 37°C for one hour. Wash the well as 
above. 

Next add lOOul of peroxidase substrate solution (Boehringer Mannheim) and 
20 incubate at room temperature for at least 5 mins (up to 30 min). Measure the 

absorbance of the sample at 405 nm by using ELIS A reader. The level of bound 
peroxidase activity: is. quantitated using an.ELIS A reader and reflects the level of_ 

tyrosine kinase activity. 

25 Example 20: High-Throughput Screening Assay Identifying Phosphorylation 
Activity 

Asa-potentiaLaltemativeand/or-complime 

kinase activity described in Example 19, an assay which detects activation 
(phosphorylation) of major intracellular signal transduction intermediates can also be 

30 used. For example, as described below one particular assay can detect tyrosine 
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phosphorylation of the Erk-1 and Erk-2 kinases. However, phosphorylation of other 
molecules, such as Raf, JNK, p38 MAP, Map kinase kinase (MEK), MEK kinase, 
Src, Muscle specific kinase (MuSK), IRAK, Tec, and Janus, as well as any other 
phosphoserine, phosphotyrosine, or phosphothreonine molecule, can be detected by 
5 substituting these molecules for Erk-1 or Erk-2 in the following assay. 

Specificallyi assay plates are made by coating the wells of a 96-well ELISA 
plate with 0.1ml of protein G (lug/ml) for 2 hr at room temp, (RT). The plates are 
then rinsed with PBS and blocked with 3% BS A/PBS for 1 hr at RT. The protein G 
plates are then treated with 2 commercial monoclonal antibodies (lOOng/well) against 
10 Erk-1 

and Erk-2 (1 hr at RT) (Santa Cruz Biotechnology). (To detect other molecules, this 
step can easily be modified by substituting a monoclonal antibody detecting any of 
the above described molecules.) After 3-5 rinses with PBS, the plates are stored at 

4PC until use. - 
15 A43 1 cells are seeded at 20,000/well in a 96-well Loprodyne filterplate and 

cultured overnight in growth medium. The cells are then starved for 48 hr in basal 

medium (DMEM) and then treated with EGF (6ng/well) or 50 ul of the supernatants 

obtained in Example 1 1 for 5-20 minutes. The cells are then solubilized and extracts 

filtered directly into the assay plate. 
20 After incubation with the extract for 1 hr at RT, the wells are again rinsed. As 

a positive control, a commercial preparation of MAP kinase (lOng/well) is used in 

place 

of A431 extract. Plates are then treated with a commercial polyclonal (rabbit) 
antibody (lug/ml) which specifically recognizes the phosphorylated epitope of the 
25 Erk-1 and Erk-2 kinases (1 hr at RT). This antibody is biotinylated by standard 
procedures. The bound polyclonal antibody is then quantitated by successive 
incubations with Europium-streptayidin and Europium fluorescence enhancing 
reagent in the Wallac DELFIA instrument (time-resolved fluorescence). An increased 
fluorescent signal over background indicates a phosphorylation. 

30 
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Example 21: Method of Determining Alterations in a Gene Corresponding to a 
Polynucleotide 

RNA isolated from entire families or individual patients presenting with a 
phenotype of interest (such as a disease) is be isolated. cDNA is then generated from 
5 these RNA samples using protocols known in the art. (See, Sambrook.) The cDNA 
is then used as a template for PCR, employing primers surrounding regions of interest 
in SEQ ID NO:X. Suggested PCR conditions consist of 35 cycles at 95°C for 30 
seconds; 60-120 seconds at 52-58°C; and 60-120 seconds at 70°C, using buffer 
solutions described in Sidransky, D., et al., Science 252:706 (1991). 

10 PCR products are then sequenced using primers labeled at their 5* end with T4 

polynucleotide kinase, employing SequiTherm Polymerase. (Epicentre 
Technologies). The intron-exon borders of selected exons is also determined and 
genomic PCR products analyzed to confirm the results. PCR products harboring 
suspected mutations is then cloned and sequenced to validate the results of the direct 

15 sequencing. 

PCR products is cloned into T-tailed vectors as described in Holton, T.A. and 
Graham, M.W., Nucleic Acids Research, 19:1 156 (1991) and sequenced with T7 
polymerase (United States Biochemical). Affected individuals are identified by 
mutations not present in unaffected individuals. 

20 Genomic rearrangements are also observed as a method of determining 

alterations in a gene corresponding to a polynucleotide. Genomic clones isolated 
according to Example 2 are nick-translated with digoxigenindeoxy-uridine 5'- 
triphosphate (Boehringer Manheim), and FISHperformed as described in Johnson, 
Cg. et al., Methods Cell Biol. 35:73-99 (1991). Hybridization with the labeled probe 

25 is carried out using a vast excess of human cot-1 DNA for specific hybridization to 
the corresponding genomic locus. 

Chromosomes are counterstained with 4,6-diamino-2-phenylidole and 
propidimn iodiderp^ucihg a combinatiorTof C- and"R-b^ds7TAlipied"iiSages"for 
precise mapping are obtained using a triple-band filter set (Chroma Technology, 

30 Brattleboro, VT) in combination with a cooled charge-coupled device camera 

(Photometries, Tucson, AZ) and variable excitation wavelength filters. (Johnson, Cv. 
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et al., Genet. Anal. Tech. Appl., 8:75 (1991).) Image collection, analysis and 
chromosomal fractional length measurements are performed using the ISee Graphical 
Program System. (Inovision Corporation, Durham, NC.) Chromosome alterations of 
the genomic region hybridized by the probe are identified as insertions, deletions, and 
translocations. These alterations are used as a diagnostic marker for an associated 
disease. 

Example 22: Method of Detecting Abnormal Levels of a Polypeptide in a 
Biological Sample 

A polypeptide of the present invention can be detected in a biological sample, 
and if an increased or decreased level of the polypeptide is detected, this polypeptide 
is a marker for a particular phenotype. Methods of detection are numerous, and thus, 
it is understood that one skilled in the art can modify the following assay to fit their 
particular needs. 

For example, antibody-sandwich ELISAs are used to detect polypeptides in a 
sample, preferably a biological sample. Wells of a microtiter plate are coated with 
specific antibodies, at a final concentration of 0.2 to 10 ug/ml. The antibodies are 
either monoclonal or polyclonal and are produced by the method described in 
Example 10. The wells are blocked so that non-specific binding of the polypeptide to 
the well is reduced. 

The coated wells are then incubated for > 2 hours at RT with a sample 
containing the polypeptide. Preferably, serial dilutions of the sample should be used 
to-validate results. The plates-are then washed three times with deionized or distilled 
water to remove unbounded polypeptide. 

Next, 50 ul of specific antibody-alkaline phosphatase conjugate, at a 
concentration of 25-400 ng, is added and incubated for 2 hours at room temperature. 
The plates are again washed three times with deionized or distilled water to remove 
~ unboundedconjugate; " ~~ 

Add 75 ul of 4-methylumbelliferyl phosphate (MUP) or p-nitrophenyl 
phosphate (NPP) substrate solution to each well and incubate 1 hour at room 
temperature. Measure the reaction by a microtiter plate reader. Prepare a standard 
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curve, using serial dilutions of a control sample, and plot polypeptide concentration 
on the X-axis (log scale) and fluorescence or absorbance of the Y-axis (linear scale). 
Interpolate the concentration of the polypeptide in the sample using the standard 
curve. 

5 

Example 23: Formulating a Polypeptide 

The secreted polypeptide composition will be formulated and dosed in a 
fashion consistent with good medical practice, taking into account the clinical 
condition of the individual patient (especially the side effects of treatment with the 

10 secreted polypeptide alone), the site of delivery, the method of administration, the 

scheduling of administration, and other factors known to practitioners. The "effective 
amount" for purposes herein is thus determined by such considerations. 

As a general proposition, the total pharmaceutical^ effective amount of 
secreted polypeptide administered parenterally per dose will be in the range of about 1 

15 |ig/kg/day to 10 mg/kg/day of patient body weight, although, as noted above, this will 
be subject to therapeutic discretion. More preferably, this dose is at least 0.01 
mg/kg/day, and most preferably for humans between about 0.01 and 1 mg/kg/day for 
the hormone. If given continuously, the secreted polypeptide is typically 
administered at a dose rate of about 1 jig/kg/hour to about ,50 n.g/kg/hour, either by 1- 

20 4 injections per day or by continuous subcutaneous infusions, for example, using a 
mini-pump. An intravenous bag solution may also be employed. The length of 
treatment needed to observe changes and the interval following treatment for 
— - responses to occur appears to vary depending on the desired effect." 

Pharmaceutical compositions containing the secreted protein of the invention 

25 are administered orally, rectally, parenterally, intracistemally, intravaginally, 

intraperitoneally, topically (as by powders, ointments, gels, drops or transdermal 
patch), bucally, or as an oral or nasal spray. "Pharmaceutically acceptable carrier" 

refers to a non^toxic solidrsemisolid or liquid fille^ 

formulation auxiliary of any type. The term "parenteral" as used herein refers to 

30 modes of administration which include intravenous, intramuscular, intraperitoneal, 
intrasternal, subcutaneous and intraarticular injection and infusion. 
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The secreted polypeptide is also suitably administered by sustained-release 
systems. Suitable examples of sustained-release compositions include semi- 
permeable polymer matrices in the form of shaped articles, e.g., films, or 
mirocapsules. Sustained-release matrices include polylactides (U.S. Pat. No. 
5 3,773,919, EP 58,481), copolymers of L-glutamic acid and gamma-ethyl-L-glutamate 
(Sidman, U. et al., Biopolymers 22:547-556 (1983)), poly (2- hydroxyethyl 
methacrylate) (R. Langer et al., J. Biomed. Mater. Res. 15:167-277 (1981), and R. 
Langer, Chem. Tech. 12:98-105 (1982)), ethylene vinyl acetate (R. Langer et al.) or 
poly-D- (-)-3-hydroxybutyric acid (EP 133,988). Sustained-release compositions 

10 also include liposomally entrapped polypeptides. Liposomes containing the secreted, 
polypeptide are prepared by methods known per se: DE 3,218,121; Epstein et al., 
Proc. Natl. Acad. Sci. USA 82:3688-3692 (1985); Hwang et al., Proc. Natl. Acad. Sci. 
USA 77:4030-4034 (1980); EP 52,322; EP 36,676; EP 88,046; EP 143,949; EP 
142,641; Japanese Pat. Appl. 83-1 18008; U.S. Pat. Nps. 4,485,045 and 4,544,545; and 

15 EP 102,324. Ordinarily, the liposomes" are of the small (about 200-800 Angstroms) 
unilamellar type in which the lipid content is greater than about 30 mol. percent 
cholesterol, the selected proportion being adjusted for the optimal secreted 
polypeptide therapy. 

For parenteral administration, in one embodiment, the secreted polypeptide is 

20 formulated generally by mixing it at the desired degree of purity, in a unit dosage 
injectable form (solution, suspension, or emulsion), with a pharmaceutically 
acceptable carrier, i.e., one that is non-toxic to recipients at the dosages and 
- concentrations employed and is compatible with other ingredients of the formulation. 
For example, the formulation preferably does not include oxidizing agents and other 

25 compounds that are known to be deleterious to polypeptides. 

Generally, the formulations are prepared by contacting the polypeptide 
uniformly and intimately with liquid carriers or finely divided solid carriers or both. 

Thenrif necessaryrthe product is shaped into the desired foflimlatidnr Preferably the - 

carrier is a parenteral carrier, more preferably a solution that is isotonic with the blood 

30 of the recipient. Examples of such carrier vehicles include water, saline, Ringer's 
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solution, and dextrose solution. Non-aqueous vehicles such as fixed oils and ethyl 
oleate are also useful herein, as well as liposomes. . 

The carrier suitably contains minor amounts of additives such as substances 
that enhance isotonicity and chemical stability. Such materials are non-toxic to 
5 recipients at the dosages and concentrations employed, and include buffers such as 
phosphate, citrate, succinate, acetic acid, and other organic acids or their salts; 
antioxidants such as ascorbic acid; low molecular weight (less than about ten 
residues) polypeptides, e.g., polyarginine or tripeptides; proteins, such as serum 
albumin, gelatin, or immunoglobulins; hydrophilic polymers such as 
10 polyvinylpyrrolidone; amino acids, such as glycine, glutamic acid, aspartic acid, or 

arginine; monosaccharides, disaccharides, and other carbohydrates including cellulose 
or its derivatives, glucose, manose, or dextrins; chelating agents such as EDTA; sugar 
alcohols such as mannitol or sorbitol; counterions such as sodium; and/or nonionic 
surfactants such as polysorbates, poloxamers, or PEG. 

15 The secreted"polypeptide is typically formulated in such vehicles at a 

concentration of about 0. 1 mg/ml to 100 mg/ml, preferably 1-10 mg/ml, at a pH of 
about 3 to 8. It will be understood that the use of certain of the foregoing excipients, 
carriers, or stabilizers will result in the formation of polypeptide salts. 

Any polypeptide to be used for therapeutic administration can be sterile. 

20 Sterility is readily accomplished by filtration through sterile filtration membranes 
(e.g., 0.2 micron membranes). Therapeutic polypeptide compositions generally are 
placed into a container having a sterile access port, for example, an intravenous 

solution-bag or vial having a stopper pierceable by a hypodermic injection needle. 

Polypeptides ordinarily will be stored in unit or multi-dose containers, for 

25 example, sealed ampoules or vials, as an aqueous solution or as a lyophilized 

formulation for reconstitution. As an example of a lyophilized formulation, 10-ml 
vials are filled with 5 ml of sterile-filtered 1 % (w/v) aqueous polypeptide solution, 

andthe resulting-mixtureislyophilized— Theinfusion"solution"is'prepmed"by 

reconstituting the lyophilized polypeptide using bacteriostatic Water-for-Injection. 

30 The invention also provides a pharmaceutical pack or kit comprising one or 

more containers filled with one or more of the ingredients of the pharmaceutical 
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compositions of the invention. Associated with such container(s) can be a notice in 
the form prescribed by a governmental agency regulating the manufacture, use or sale 
of pharmaceuticals or biological products, which notice reflects approval by the 
agency of manufacture, use or sale for human administration. In addition, the 
5 polypeptides of the present invention may be employed in conjunction with other 
therapeutic compounds. 

Example 24: Method of Treating Decreased Levels of the Polypeptide 

It will be appreciated that conditions caused by a decrease in the standard or 
10 normal expression level of a secreted protein in an individual can be treated by 
administering the polypeptide of the present invention, preferably in the secreted 
form. Thus, the invention also provides a method of treatment of an individual in 
need of an increased level of the polypeptide comprising administering to such an 
individual a pharmaceutical composition comprising an amount of the polypeptide to 
15 increase the activity level of the: polypeptide in such an individual. 

For example, a patient with decreased levels of a polypeptide receives a daily 
dose 0.1-100 ug/kg of the polypeptide for six consecutive days. Preferably, the 
polypeptide is in the secreted form. The exact details of the dosing scheme, based on 
administration and formulation, are provided in Example 23. 

20 

Example 25: Method of Treating Increased Levels of the Polypeptide 

Antisense technology is used to inhibit production of a polypeptide of the 
present invention. This technology is one example of a method of decreasing levels 
of a polypeptide, preferably a secreted form, due to a variety of etiologies, such as 
25 cancer. 

For example, a patient diagnosed with abnormally increased levels of a 
polypeptide is administered intravenously antisense polynucleotides at 0.5, 1.0, 1.5, 

2:0-and-3:0-mg/kg day for~21daysr-' This treatment isTepeated after a" 7-day rest 

period if the treatment was well tolerated. The formulation of the antisense 

30 polynucleotide is provided in Example 23. 
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Example 26: Method of Treatment Using Gene Therapy 

One method of gene therapy transplants fibroblasts, which are capable of 
expressing a polypeptide, onto a patient. Generally, fibroblasts are obtained from a 
subject by skin biopsy. The resulting tissue is placed in tissue-culture medium and . 
separated into small pieces. Small chunks of the tissue are placed on a wet surface of 
a tissue culture flask, approximately ten pieces are placed in each flask. The flask is 
turned upside down, closed tight and left at room temperature over night. After 24 
hours at room temperature, the flask is inverted and the chunks of tissue remain fixed 
to the bottom of the flask and fresh media (e.g., Ham's F12 media, with 10% FBS, 
penicillin and streptomycin) is added. The flasks are then incubated at 37°C for 
approximately one week. 

At this time, fresh media is added and subsequently changed every several 
days. After an additional two weeks in culture, a monolayer of fibroblasts emerge. 
The monolayer is trypsinized and scaled into larger flasks. 

pMV-7 (Kifschmeier, P:T. et al., DNA, 7:219-25 (1988)), flanked by the long 
terminal repeats of the Moloney murine sarcoma virus, is digested with EcoRI and 
Hindlll and subsequently treated with calf intestinal phosphatase. The linear vector is 
fractionated on agarose gel and purified, using glass beads. 

The cDNA encoding a polypeptide of the present invention can be amplified 
using PGR primers which correspond to the 5' and 3' end sequences respectively as set 
forth in Example 1 . Preferably, the 5' primer contains an EcoRI site and the 3' primer 
includes a HindlQ site. Equal quantities of the Moloney murine sarcoma virus linear 
backbone and the.amplified EcoRI and Hindlll fragment are added together, in the 
presence of T4 DNA ligase. The resulting mixture is maintained under conditions 
appropriate for ligation of the two fragments. The ligation mixture is then used to 
transform bacteria HB101, which are then plated onto agar containing kanamycin for 
the purpose of confirming that the vector has the gene of interest properly inserted. 

The-amphotropic-pA317~or-GP+aml-2-packaging cells-are-grown in tissue 

culture to confluent density in Dulbecco's Modified Eagles Medium (DMEM) with 
10% calf serum (CS), penicillin and streptomycin. The MSV vector containing the 
gene is then added to the media and the packaging cells transduced with the vector. 



WO 99/47540 



PCT7US99/05804 



The packaging cells now produce infectious viral particles containing the gene (the 
packaging cells are now referred to as producer cells). 

Fresh media is added to the transduced producer cells, and subsequently, the 
media is harvested from a 10 cm plate of confluent producer cells. The spent media, 
5 containing the infectious viral particles, is filtered through a millipore filter to remove 
detached producer cells and this media is then used to infect fibroblast cells. Media is 
removed from a sub-confluent plate of fibroblasts and quickly replaced with the 
media from the producer cells. This media is removed and replaced with fresh media. 
If the titer of virus is high, then virtually all fibroblasts will be infected and no 
10 selection is required. If the titer is very low, then it is necessary to use a retroviral 

vector that has a selectable marker, such as neo or his. Once the fibroblasts have been 
efficiently infected, the fibroblasts are analyzed to determine whether protein is 
produced. 

The engineered fibroblasts are then transplanted onto the host, either alone or 
15 after having been grown to confluence On cytodex 3 microcafrier beads. 



Example 27: Method of Treatment Using Gene Therapy - In Vivo 

20 Another aspect of the present invention is using in vivo gene therapy methods 

to treat disorders, diseases and conditions. The gene therapy method relates to the 
introduction of naked nucleic acid (DNA, RNA, and antisense DNA or RNA) 
sequences into an animal to incr ease or decrease the expression of the polypeptide. 
The polynucleotide of the present invention may be operatively linked to a promoter 

25 or any other genetic elements necessary for the expression of the polypeptide by the 
target tissue. Such gene therapy and delivery techniques and methods are known in 
the art, see, for example, WO90/11092, W098/11779; U.S. Patent NO. 5693622, 

5705151, 5 580859; Tabata H. etal . ( 1997) C ardiovasc. Res. 35(3 ):470- 479, Chao J et 

al. (1997) Pharmacol. Res. 35(6):5 17-522, Wolff J.A. (1997) Neuromuscul. Disord. 

30 7(5):314-318, Schwartz B. et al. (1996) Gene Ther. 3(5):405-41 1 , Tsurumi Y. et al. 
(1996) Circulation 94(12):328 1-3290 (incorporated herein by reference). 
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The polynucleotide constructs may be delivered by any method that delivers 
injectable materials to the cells of an animal, such as, injection into the interstitial 
space of tissues (heart, muscle, skin, lung, liver, intestine and the like). The 
polynucleotide constructs can be delivered in a pharmaceutical^ acceptable liquid or 
aqueous carrier. 

The term "naked" polynucleotide, DNA or RNA, refers to sequences that are 
free from any delivery vehicle that acts to assist, promote, or facilitate entry into the 
cell, including viral sequences, viral particles, liposome formulations, lipofectin or 
precipitating agents and the like. However, the polynucleotides of the present 
invention may also be delivered in liposome formulations (such as those taught in 
Feigner P.L. et al. (1995) Ann. NY Acad. Sci. 772:126-139 and Abdallah B. et al. 
(1995) Biol. Cell 85(1): 1-7) which can be prepared by methods well known to those 
skilled in the art. 

The polynucleotide vector constructs used in the gene therapy method are 
preferably constructs that. will not' integrate into the host genome nor will they contain 
sequences that allow for replication. Any strong promoter known to those skilled in 
the art can be used for driving the expression of DNA. Unlike other gene therapies 
techniques, one major advantage of introducing naked nucleic acid sequences into 
target cells is the transitory nature of the polynucleotide synthesis in the cells. Studies 
have shown that non-replicating DNA sequences can be introduced into cells to 
provide production of the desired polypeptide for periods of up to six months. 

The polynucleotide construct can be delivered to the interstitial space of 
tissues within the an animal, including of muscle, skin, brain, lung, liver, spleen, bone 
marrow, thymus, heart, lymph, blood, bone, cartilage, pancreas, kidney, gallbladder, 
stomach, intestine, testis, ovary, uterus, rectum, nervous system, eye, gland, and 
connective tissue. Interstitial space of the tissues comprises the intercellular fluid, 
mucopolysaccharide matrix among the reticular fibers of organ tissues, elastic fibers 
in the walls of vessels or chambers, collagen fibers of fibrous tissues, or that same 
"matrix within connectivFtissue ensheatfiing muscle cells or in the lacunae oTboneTTt 
is similarly the space occupied by the plasma of the circulation and the lymph fluid of 
the lymphatic channels. Delivery to the interstitial space of muscle tissue is preferred 
for the reasons discussed below. They may be conveniently delivered by injection 
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into the tissues comprising these cells. They are preferably delivered to and 
expressed in persistent, non-dividing cells which are differentiated, although delivery 
and expression may be achieved in non-differentiated or less completely 
differentiated cells, such as, for example, stem cells of blood or skin fibroblasts. In 
5 vivo muscle cells are particularly competent in their ability to take up and express 
polynucleotides. 

For the naked polynucleotide injection, an effective dosage amount of DNA or 
RNA will be in the range of from about 0.05 g/kg body weight to about 50 mg/kg 
body weight. Preferably the dosage will be from about 0.005 mg/kg to about 20 

10 mg/kg and more preferably from about 0.05 mg/kg to about 5 mg/kg. Of course, as 
the artisan of ordinary skill will appreciate, this dosage will vary according to the 
tissue site of injection. The appropriate and effective dosage of nucleic acid sequence 
can readily be determined by those of ordinary skill in the art and may depend on the 
condition being treated and the route of administration. The preferred route of 

15 administration is by the parenteral route, of injection into the interstitial space of 
tissues. However, other parenteral routes may also be used, such as, inhalation of an 
aerosol formulation particularly for delivery to lungs or bronchial tissues, throat or 
mucous membranes of the nose. In addition, naked polynucleotide constructs can be 
delivered to arteries during angioplasty by the catheter used in the procedure. 

20 The dose response effects of injected polynucleotide in muscle in vivo is 

determined as follows. Suitable template DNA for production of mRNA coding for 
polypeptide of the present invention is prepared in accordance with a standard 
recombinant DNA methodology. The template DNA, which may be either circular or 
linear, is either used as naked DNA or complexed with liposomes. The quadriceps 

25 muscles of mice are then injected with various amounts of the template DNA. 

Five to six week old female and male Balb/C mice are anesthetized by 
intraperitoneal injection with 0.3 ml of 2.5% Avertin. A 1.5 cm incision is made on 
the anterior thigh, and the quadriceps muscle is directly visualized. The template 
DNA~is~injectecllir07l"lnl~of carrieFinaT"! cc syringe through a 27 gauge needle over 

30 one minute, approximately 0.5 cm from the distal insertion site of the muscle into the 
knee and about 0.2 cm deep. A suture is placed over the injection site for future 
localization, and the skin is closed with stainless steel clips. 
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After an appropriate incubation time (e.g., 7 days) muscle extracts are 
prepared by excising the entire quadriceps. Every fifth 15 um cross-section of the 
individual quadriceps muscles is histochemically stained for protein expression. A 
time course for protein expression may be done in a similar fashion except that 
quadriceps from different mice are harvested at different times. Persistence of DNA 
in muscle following injection may be determined by Southern blot analysis after 
preparing total cellular DNA and HIRT supernatants from injected and control mice. 
The results of the above experimentation in mice can be use to extrapolate proper 
dosages and other treatment parameters in humans and other animals using naked 
DNA. 

Example 28: Transgenic Animals. 

The polypeptides of the invention can also be expressed in transgenic animals. 
Animals of any species, including, but not limited to, mice, rats, rabbits, hamsters, 
guinea pigs, pigs, micro-pigs, goats, .sheep, cows and non-human primates, e.g., 
baboons, monkeys, and chimpanzees may be used to generate transgenic animals. In a 
specific embodiment, techniques described herein or otherwise known in the art, are 
used to express polypeptides of the invention in humans, as part of a gene therapy 
protocol. 

Any technique known in the art may be used to introduce the transgene (i.e., 
polynucleotides of the invention) into animals to produce the founder lines of 
transgenic animals. Such techniques include, but are not limited to, pronuclear 
microinjection (Paterson et al., Appl. Microbiol. Biotechnol. 40:691-698 (1994); 
Carver et al., Biotechnology (NY) 11:1263-1270 (1993); Wright et al., Biotechnology 
(NY) 9:830-834 (1991); and Hoppe et al., U.S. Pat. No. 4,873,191 (1989)); retrovirus 
mediated gene transfer into germ lines (Van der Putten et al., Proc. Natl. Acad. Sci., 
USA 82:6148-6152 (1985)), blastocysts or embryos; gene targeting in embryonic 
stem cells (Thompson et al., Cell 56:313-321 (1989)); electroporation of cells or 
embryos (Lo7T983, Mol Cell. Biol. 3?1803-1814 (19 83 ^introduction of the 
polynucleotides of the invention using a gene gun (see, e.g., Ulmer et al., Science 
259:1745 (1993); introducing nucleic acid constructs into embryonic pleuripotent 
stem cells and transferring the stem cells back into the blastocyst; and sperm- 
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mediated gene transfer (Lavitrano et ah, Cell 57:717-723 (1989); etc. For a review of 
such techniques, see Gordon, "Transgenic Animals," Inth Rev. Cytol. 115:171-229 
(1989), which is incorporated by reference herein in its entirety. 

Any technique known in the art may be used to produce transgenic clones 
containing polynucleotides of the invention, for example, nuclear transfer into 
enucleated oocytes of nuclei from cultured embryonic, fetal, or adult cells induced to 
quiescence (Campell et al., Nature 380:64-66 (1996); Wilmut et ah, Nature 385:810- 
813 (1997)). 

The present invention provides for transgenic animals that carry the transgene 
in all their cells, as well as animals which carry the transgene in some, but not all their 
cells, i.e., mosaic animals or chimeric. The transgene may be integrated as a single 
transgene or as multiple copies such as in concatamers, e.g., head-to-head tandems or 
head-to-tail tandems. The transgene may also be selectively introduced into and 
activated in a particular cell type by following, for example, the teaching of Lasko et 
ah (Lasko et ah, Proc. Natl. Acad, Sci. USA 89:6232-6236 (1992)). The regulatory 
sequences required for such a cell-type specific activation will depend upon the 
particular cell type of interest, and will be apparent to those of skill in the art. When 
it is desired that the polynucleotide transgene be integrated into the chromosomal site 
of the endogenous gene, gene targeting is preferred. Briefly, when such a technique is 
to be utilized, vectors containing some nucleotide sequences homologous to the 
endogenous gene are designed for the purpose of integrating, via homologous 
recombination with chromosomal sequences, into and disrupting the function of the 
nucleotide sequence of the endogenous gene. The transgene jmay also be selectively 
introduced into a particular cell type, thus inactivating the endogenous gene in only 
that cell type, by following, for example, the teaching of Gu et ah (Gu et al., Science 
265:103-106 (1994)). The regulatory sequences required for such a cell-type specific 
inactivation will depend upon the particular cell type of interest, and will be apparent 
to those of skill in the art. 

Once transgenic animals have been generated, the expression of the 
recombinant gene may be assayed utilizing standard techniques. Initial screening 
may be accomplished by Southern blot analysis or PCR techniques to analyze animal 
tissues to verify that integration of the transgene has taken place. The level of mRNA 



WO 99/47540 



PCT/US99/05804 



282 

expression of the transgene in the tissues of the transgenic animals may also be 
assessed using techniques which include, but are not limited to, Northern blot analysis 
of tissue samples obtained from the animal, in situ hybridization analysis, and reverse 
transcriptase-PCR (rt-PCR). Samples . of transgenic gene-expressing tissue may also 
5 be evaluated immunocytochemically or immunohistochemically using antibodies 
specific for the transgene product. 

Once the founder animals are produced, they may be bred, inbred, outbred, or 
crossbred to produce colonies of the particular animal. Examples of such breeding 
strategies include, but are not limited to: outbreeding of founder animals with more 

10 than one integration site in order to establish separate lines; inbreeding of separate 
lines in order to produce compound transgenics that express the transgene at higher 
levels because of the effects of additive expression of each transgene; crossing of 
heterozygous transgenic animals to produce animals homozygous for a given 
integration site in order to both augment expression and eliminate the need for 

15 screening of animals by DNA analysis; crossing of separate homozygous lines to 
produce compound heterozygous or homozygous lines; and breeding to place the 
transgene on a distinct background that is appropriate for an experimental model of 
interest. 

Transgenic animals of the invention have uses which include, but are not 
20 limited to, animal model systems useful in elaborating the biological function of 
polypeptides of the present invention, studying conditions and/or disorders associated 
with aberrant expression, and in screening for compounds effective in ameliorating 
such conditions and/or disorders. 



25 Example 29: Knock-Out Animals. 

Endogenous gene expression can also be reduced by inactivating or "knocking 
out" the gene and/or its promoter using targeted homologous recombination. (E.g., 
see Smithies et al., Nature 317:230-234 (1985); Thomas & Capecchi, Cell 51:503- 
512 (1987); Thompson et al., Cell 5:313-321 (1989); each of which is incorporated by 
30 reference herein in its entirety). For example, a mutant, non-functional 
polynucleotide of the invention (or a completely unrelated DNA sequence) flanked by 
DNA homologous to the endogenous polynucleotide sequence (either the coding 
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regions or regulatory regions of the gene) can be used, with or without a selectable 
marker and/or a negative selectable marker, to transfect cells that express 
polypeptides of the invention in vivo. In another embodiment, techniques known in 
the art Eire used to generate knockouts in cells that contain, but do not express the gene 
5 of interest. Insertion of the DNA construct, via targeted homologous recombination, 
results in inactivation of the targeted gene. Such approaches are particularly suited in 
research and agricultural fields where modifications to embryonic stem cells can be 
used to generate animal offspring with an inactive targeted gene {e.g., see Thomas & 
Capecchi 1987 and Thompson 1989, supra). However this approach can be routinely 

10 adapted for use in humans provided the recombinant DNA constructs are directly 
administered or targeted to the required site in vivo using appropriate viral vectors that 
will be apparent to those of skill in the art. 

In further embodiments of the invention, cells that are genetically engineered 
to express the polypeptides of the invention, or alternatively, that are genetically 

15 engineered not to .express the polypeptides of the invention (e.g., knockouts) , are 
administered to a patient in vivo. Such cells may be obtained from the patient (i.e., 
animal, including human) or an MHC compatible donor and can include, but are not 
limited to fibroblasts, bone marrow cells, blood cells (e.g. . lymphocytes), adipocytes, 
muscle cells, endothelial cells etc. The cells are genetically engineered in vitro using 

20 recombinant DNA techniques to introduce the coding sequence of polypeptides of the 
invention into the cells, or alternatively, to disrupt the coding sequence and/or 
endogenous regulatory sequence associated with the polypeptides of the invention, 
e.g. , by transduction (using viral vectors, and preferably vectors that integrate the 
transgene into the cell genome) or transfection procedures, including, but not limited 

25 to, the use of plasmids, cosmids, YACs, naked DNA, electroporation, liposomes, etc. 
The coding sequence of the polypeptides of the invention can be placed under the 
control of a strong constitutive or inducible promoter or promoter/enhancer to achieve 
expression, and preferably secretion, of the polypeptides of the invention. The 
engineered cells which express and preferably secrete the polypeptides of the 

30 invention can be introduced into the patient systemically, e.g., in the circulation, or 
intraperitoneally . 
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Alternatively, the cells can be incorporated into a matrix and implanted in the 
body, e^g., genetically engineered fibroblasts can be implanted as part of a skin graft; 
genetically engineered endothelial cells can be implanted as part of a lymphatic or 
vascular graft. (See, for example, Anderson et al. U.S. Patent No. 5,399,349; and 
5 Mulligan & Wilson, U.S. Patent No. 5,460,959 each of which is incorporated by 
reference herein in its entirety). 

When the cells to be administered are non-autologous or non-MHC 
compatible cells, they can be administered using well known techniques which 
prevent the development of a host immune response against the introduced cells. For 
10 example, the cells may be introduced in an encapsulated form which, while allowing 
for an exchange of components with the immediate extracellular environment, does 
not allow the introduced cells to be recognized by the host immune system. 

Transgenic and "knock-out" animals of the invention have uses which include, 
but are not limited to, animal model systems useful in elaborating the biological 
15 function of polypeptides of the present invention, studying conditions and/or disorders 
associated with aberrant expression, and. in screening for compounds effective in 
ameliorating such conditions and/or disorders. 

It will be clear that the invention may be practiced otherwise than as 
20 particularly described in the foregoing description and examples. Numerous 

modifications and variations of the present invention are possible in light of the above 
teachings and, therefore, are within the scope of the appended claims. 

The entire disclosure of each document cited (including patents, patent 
applications, journal articles, abstracts, laboratory manuals, books, or other 
25 disclosures) in the Background of the Invention, Detailed Description, and Examples 
is hereby incorporated herein by reference. Further, the hard copy of the sequence 
listing submitted herewith and the corresponding computer readable form are both 
incor po rated here i n by reference in the ir entireties. 
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INDICATIONS RELATING TO A DEPOSITED MICROORGANISM 

(PCT Rule \3bb) 



A The indications made below relate to the microorganism referred to in the description 



on page 



180 



, line 



N/A 



B. IDENTIFICATION OFDEPOSIT 



Further deposits are identified on an addi tional sheet | | 



Name of depositary institution American Type Culture Collection 



Address of depositary institution (including postal code and country) 

10801 University Boulevard 
Manassas, Virginia 201 1 0-2209 
United States of America 



Date of deposit 

February 12, 1998 


Accession Number 

209628 


C. ADDITIONAL INDICATIONS ( leave blank if not applicable) This information is continued on an additional sheet | | 



D. DESIGNATED STATES FOR WHICH INDICATIONS ARE MADE (if the indications are notfor all designated States) 



E. SEPARATE FURNISHING OF INDICATIONS (leave blank if not applicable ) 



The indications listed below will be submitted to the International Bureau later (specify the general nature of the indications e.g., "Accession 
Number of Deposit") 



For receiving Office use only 



This sheet was received with the international application 



Authorized officer 



For International Bureau use only 



| | This sheet was received by the International Bureau on: 



Authorized officer 



Form PCT/RO/134 (July 1992) 
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INDICATIONS RELATING TO A DEPOSITED MICROORGANISM 

(PCTRule 13 bis) 



A- The indications made below relate to the microorganism referred to in the description 
on page 152 .line NM 



B. IDENTIFICATIONOFDEPOSIT Further deposits are identified on an additional sheet | | 



Name of depositary institution American Type Culture Collection 



Address of depositary institution (including postal code and country) 

10801 University Boulevard 
Manassas, Virginia 201 10-2209 
United States of America 



Date of deposit 

February 25, 1998 


Accession Number 

209641 


C ADDITIONAL INDICATIONS (leaveblankif not applicable) . This information is continued on an additional sheet | | 



D. DESIGNATED STATES FOR WHICH INDICATIONS ARE MADE (if the indications are not for all designated Stales) 



E. SEPARATE FURNISHING OF INDICATIONS ( leave blank if not applicable) 



The indications listed below will be submitted to the International Bureau later (specify the general nature of the indications e.g., "Accession 
Number of Deposit") 



For receiving Office use only 



V | This sheet was received with the international application 



Authorized officer 



For International Bureau use only 



I [ This sheet was received by the International Bureau on: 



Authorized officer 



Form PCT/RO/134 (July 1992) 
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INDICATIONS RELATING TO A DEPOSITED MICROORGANISM 

(PCT Rule \3bis) 



A- The indications made below relate to the microorganism referred to in the description 

on page 1§§ .line . 

B. IDE^TTIFICATIONOFDEPOSIT Further deposits are identified on an additional sheet | [ 

Name of depositary institution American Type Culture Collection 



Address of depositary institution (including postal code and country) 

10801 University Boulevard 
Manassas, Virginia 201 1 0-2209 
United States of America 



Date of deposit 

March 4, 1998 


Accession Number 

209651 
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What Is Claimed Is: 

1 . An isolated nucleic acid molecule comprising a polynucleotide having 
a nucleotide sequence at least 95% identical to a sequence selected from the group 
5 consisting of: 

(a) a polynucleotide fragment of SEQ ID NO:X or a polynucleotide fragment 
of the cDNA sequence included in ATCC Deposit No:Z, which is hybridizable to 
SEQ ID NO:X; 

(b) a polynucleotide encoding a polypeptide fragment of SEQ ID NO: Y or a 
10 polypeptide fragment encoded by the cDNA sequence included in ATCC Deposit 

No:Z, which is hybridizable to SEQ ID NO:X; 

(c) a polynucleotide encoding a polypeptide domain of SEQ ID NO: Y or a 
polypeptide domain encoded by the cDNA sequence included in ATCC Deposit 
No:Z, which is hybridizable to SEQ ID NO:X; - 

15 (d) a polynucleotide encoding a polypeptide epitope of SEQ ID NO:Y or a 

polypeptide epitope encoded by the cDNA sequence included in ATCC Deposit 
No:Z, which is hybridizable to SEQ ID NO:X; 

(e) a polynucleotide encoding a polypeptide of SEQ ID NO: Y or the cDNA 
sequence included in ATCC Deposit No:Z, which is hybridizable to SEQ ID NO:X, 

20 having biological activity; 

(f) a polynucleotide which is a variant of SEQ ID NO:X; 

(g) a polynucleotide which is an allelic variant of SEQ ID NO:X; 

(h) a polynucleotide which encodes a species homologue of the SEQ ID 

NO:Y; 

25 (i) a polynucleotide capable of hybridizing under stringent conditions to any 

one of the polynucleotides specified in (a)-(h), wherein said polynucleotide does not 

hybridize u nder str in gent conditio ns-to a nu cleic acid molecule having a nucleotide 

sequence of only A residues or of only T residues. 
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2. The isolated nucleic acid molecule of claim 1, wherein the 
polynucleotide fragment comprises a nucleotide sequence encoding a secreted 
protein. 

3. The isolated nucleic acid molecule of claim 1, wherein the 
polynucleotide fragment comprises a nucleotide sequence encoding the sequence 
identified as SEQ ID NO:Y or the polypeptide encoded by the cDNA sequence 
included in ATCC Deposit No:Z, which is hybridizable to SEQ ID NO:X. 

4. The isolated nucleic acid molecule of claim 1, wherein the 
polynucleotide fragment comprises the entire nucleotide sequence of SEQ ID NO:X 
or the cDNA sequence included in ATCC Deposit No:Z, which is hybridizable to 
SEQ ID NO:X. 

5. The Isolated nucleic acfd molecule of claim 2, wherein the nucleotide 
sequence comprises sequential nucleotide deletions from either the C-terminus or the 
N-terminus. 

6. The isolated nucleic acid molecule of claim 3, wherein the nucleotide 
sequence comprises sequential nucleotide deletions from either the C-terminus or the 
N-terminus. 

7. A recombinant vector comprising the isolated nucleic acid molecule of 
claim 1. 

8. A method of making a recombinant host cell comprising the isolated 
nucleic acid molecule of claim 1. 



9. A recombinant host cell produced by the method of claim 8. 



10. The recombinant host cell of claim 9 comprising vector sequences. 



WO 99/47540 



PCT/US99/05804 



290 

11. An isolated polypeptide comprising an amino acid sequence at least 
95% identical to a sequence selected from the group consisting of: 

(a) a polypeptide fragment of SEQ ID NO:Y or the encoded sequence 
included in ATCC Deposit No:Z; 

(b) a polypeptide fragment of SEQ ID NO:Y or the encoded sequence 
included in ATCC Deposit No:Z, having biological activity; 

(c) a polypeptide domain of SEQ ID NO: Y or the encoded sequence included 
in ATCC Deposit No:Z; 

(d) a polypeptide epitope of SEQ ID NO: Y or the encoded sequence included 
in ATCC Deposit No:Z; 

(e) a secreted form of SEQ ID NO: Y or the encoded sequence included in 
ATCC Deposit No:Z; 

(f) a full length protein of SEQ ID NO: Y or the encoded sequence included in 
ATCC Deposit No:Z; : - " 

(g) a variant of SEQ ID NO: Y; 

(h) an allelic variant of SEQ ID NO:Y; or 

(i) a species homologue of the SEQ ID NO:Y. 

12. The isolated polypeptide of claim 11, wherein the secreted form or the 
full length protein comprises sequential amino acid deletions from either the C- 
terminus or the N-terminus. 

13. "Ah isolated "antibody that binds "specifically" tothe isolated polypeptide 
of claim 1 1 . 

14. A recombinant host cell that expresses the isolated polypeptide of 
claim 11. 



15. A method of making an isolated polypeptide comprising: 
(a) culturing the recombinant host cell of claim 14 under conditions such that 
said polypeptide is expressed; and 
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(b) recovering said polypeptide. 

16. The polypeptide produced by claim 15. 

17. A method for preventing, treating, or ameliorating a medical condition, 
comprising administering to a mammalian subject a therapeutically effective amount 
of the polypeptide of claim 1 1 or the polynucleotide of claim 1. 

18. A method of diagnosing a pathological condition or a susceptibility to 
a pathological condition in a subject comprising: 

(a) determining the presence or absence of a mutation in the polynucleotide of 
claim 1; and 

(b) diagnosing a pathological condition or a susceptibility to a pathological 
condition based on the presence or absence of said mutation. 

19. A method of diagnosing a pathological condition or a susceptibility to 
a pathological condition in a subject comprising: 

(a) determining the presence or amount of expression of the polypeptide of 
claim 1 1 in a biological sample; and 

(b) diagnosing a pathological condition or a susceptibility to a pathological 
condition based on the presence or amount of expression of the polypeptide. 

20. A method for identifying a binding partner to the pblypeptide"of clainT 
1 1 comprising: 

(a) contacting the polypeptide of claim 1 1 with a binding partner; and 

(b) determining whether the binding partner effects an activity of the 
polypeptide. 



21. The gene corresponding to the cDNA sequence of SEQ ID NO:Y. 



WO 99/47540 



PCT/US99/05804 



292 

22. A method of identifying an activity in a biological assay, wherein the 
method comprises: 

(a) expressing SEQ ID NO:X in a cell; 

(b) isolating the supernatant; 

5 (c) detecting an activity in a biological assay; and 

(d) identifying the protein in the supernatant having the activity. 

23. The product produced by the method of claim 20. 
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<110> Human Genome Sciences, Inc., et al . 

<120> 95 Human Secreted Proteins 

<130> PZ027PCT 

<140> Unassigned 
<141> 1999, March 18 

<150> 60/078,566 
<151> 1998-03-19 

<150> 60/078,574 
<151> 1998-03-19 

<150> 60/078,576 
<151> 1998-03-19 

<150> 60/078,563 
<151> 1998-03-19 

<150> 60/078,573 
<151> 1998-03-19 

<150> 60/078,578 -~ ... _■- " -■ 

<151> 1998-03-19 - - - . 

<150> 60/078,579 
<151> 1998-03-19 

<150> 60/078,581 
<151> 1998-03-19 

<150> 60/078,577 
<151> 1998-03-19 

<150> 60/080,314 
<151> 1998-04-01 

<150> 60"/080,312 
<151> 1998-04-01 

<150> 60/080,313 
<151> 1998-04-01 

<160> 392 

-<170>-PatentIn_Ver— 2-0 * 

<210> 1 

<211> 733 

<212> DNA 

<213> Homo sapiens 

<400> 1 
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gggatccgga gcccaaatct tctgacaaaa ctcacacatg cccaccgtgc ccagcacctg 60 

aattcgaggg tgcaccgtca gtcttcctct tccccccaaa acccaaggac accctcatga 120 

tctcccggac tcctgaggtc acatgcgtgg tggtggacgt aagccacgaa gaccctgagg 180 

tcaagttcaa ctggtacgtg gacggcgtgg aggtgcataa tgccaagaca aagccgcggg 240 

aggagcagta caacagcacg taccgtgtgg tcagcgtcct caccgtcctg caccaggact 300 

ggctgaatgg caaggagtac aagtgcaagg tctccaacaa agccctccca acccccatcg 360 

agaaaaccat ctccaaagcc aaagggcagc cccgagaacc acaggtgtac accctgcccc 420 

catcccggga tgagctgacc aagaaccagg tcagcctgac ctgcctggtc aaaggcttct 480 

atccaagcga catcgccgtg gagtgggaga gcaatgggca gccggagaac aactacaaga 540 

ccacgcctcc cgtgctggac tccgacggct ccttcttcct ctacagcaag ctcaccgtgg 600 

acaagagcag gtggcagcag gggaacgtct tctcatgctc cgtgatgcat gaggctctgc 660 

acaaccacta cacgcagaag agcctctccc tgtctccggg taaatgagtg cgacggccgc 720 

gactctagag gat 733 



<210> 2 

<211> 5 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> Site 
<222> (3) 

<223> Xaa equals any of the twenty naturally ocurring L-amino acids 

<400> 2 - ' 

Trp Ser Xaa Trp Ser 
1 5 

<210> 3 

<211> 86 

<212> DNA 

<213> Homo sapiens 

<400> 3 

gcgcctcgag atttccccga aatctagatt tccccgaaat gatttccccg aaatgatttc 60 
cccgaaatat ctgccatctc aattag 86 

<210> 4 

<211> 27 
<212> DNA 
<213> Homo sapiens 

<400> 4 

gcggcaagct ttttgcaaag cctaggc 27 



^210 > 5 
<211> 271 
<212> DNA 

<213> Homo sapiens 
<400> 5 

ctcgagattt ccccgaaatc tagatttccc cgaaatgatt tccccgaaat gatttccccg 60 
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aaatatctgc catctcaatt agtcagcaac catagtcccg cccctaactc cgcccatccc 120 

gcccctaact ccgcccagtt ccgcccattc tccgccccat ggctgactaa ttttttttat 180 

ttatgcagag gccgaggccg cctcggcctc tgagctattc cagaagtagt gaggaggctt 240 

ttttggaggc ctaggctttt gcaaaaagct t 271 



<210> 6 
<211> 32 
<212> DNA 

<213> Homo sapiens 
<400> 6 

gcgctcgagg gatgacagcg atagaacccc gg 3 2 



<210> 7 

<211> 31 

<212> DNA 

<213> Homo sapiens 

<400> 7 

gcgaagcttc gcgactcccc ggatccgcct c 31 



"<210> 8 . ' " "~- " .. 

<211> 12 ' ■ i - --- 

<212> DNA 

<213> Homo sapiens 
<400> 8 

ggggactttc cc 12 



<210> 9 

<211> 73 

<212> DNA 

<213> Homo sapiens 

<400> 9 

gcggcctcga -ggggactttc ccggggactt -tccggggact ttccgggact ttccatcctg "60 
ccatctcaat tag 73 



<210> 10 

<211> 256 

<212> DNA 

<213> Homo sapiens 



-<400> 10 

ctcgagggga ctttcccggg gactttccgg ggactttccg ggactttcca tctgccatct 60 

caattagtca gcaaccatag tcccgcccct aactccgccc atcccgcccc taactccgcc 120 

cagttccgcc cattctccgc cccatggctg actaattttt tttatttatg cagaggccga 180 

ggccgcctcg gcctctgagc tattccagaa gtagtgagga ggcttttttg gaggcctagg 240 

cttttgcaaa aagctt 256 



RN«>nor.irv ..-wn oad7.«VdnAi i * 
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<210> 11 
<211> 2343 
<212> DNA 

<213> Homo sapiens 



<400> 11 

acgcgtccgg tttttcaaag gtttaactgt ccagggcaga tacttaagac tatctgatca 60 

tccattaaaa acttttcaca tagtcttgct taaatggatc cattatgttt acccattata 120 

ttgttctcag ctgtagtttt aagaaattta tttcatttgt taatacttac tttccattac 180 

cttccccttt tctgtgacaa tccgttgata cttgaagacc tctcttgtat tcatcttagg 240 

gttaatattt ttaaggccaa acagcctaaa ttctatggta atcaactcca gccttgtgta 300 

atgaaatett ctgcataaag ataggtttaa atcaaatcag attgcagatt ttattgaaga 360 

aattgtgttt ttaagagttg acaaatatat gttgtatggc taaaacaaag aaaatacttc 420 

tgttgcttct gcatttagta gaagaaaaac tatatatgtt tgtgaccaaa gtataaaata 480 

tgattctttc cagggaggta aaggttatgc acaagatttt cactagcagc tctaaaaggc 540 

taccctcaat taattgccat gaacatttca tagccctaga aggatgtagg ctcatttcag 600 

tgtcatcctg gtttattctt tattgtatta ttcagcagtc attttaacac tatgctagac 660 

actttagaga ttcagaagag taacagggtt tctgttctca tgaagcttat caggagacag 720 

aaaacatatg aattagatct aattggaggc aaactgaaat atatagtgga gttagtgtgg 780 

ttatcagcac ataaatgagt gatccatcaa caaaaggaga aattgggagg gttttatggg 840 

ccaaaaacag catgattaaa tgtgatagag tatatgtcat gttttaggtg tgatgaacat 900 

tcagttatgt gtgacgaata ggataattga aaaaatatga aaggctatga tgccagaaag 960. 

tattatggga caagatctta aaaccagtgt tacctaggga gtatgaattt aatatgggaa 102 0 

ttcttaaact cctttatgac tggaaga.tga gcatcagagt gtctgcgacc attttgatga .1080 

tatgatgtac cagtttttaa atgtttggct ttttccaggt gatgaaagcg ggggatgagt 1140 

taagaaccac tgctgtgaag gattcacaac tatttttagg cagttgggta aaaatgacca 1200 

atttagtttt aagaaactga ctgtggctcc agagtatgtt ggagaagtga aaatggagac 1260 

taggaataac aggtgggaga ctattagtct aattaagatg taattataaa tctaagctag 1320 

gaacgtaaaa tgagaatgca aagtaagaaa caaatatggg gaaaattata tgtaaaagta .1380 

ataggacttg gcatcttact gatgtgattg attatgagaa aaatgaagca tgtggaggag 1440 

tccactggac agtaggaaat tcagcctaag acttgggtaa gagttctgtg gagttgtgaa 1500 

ttcagaggcc agagatgtga tatttaaaat tttggttcaa gatttcccag gtataagaaa 1560 

gcaagaggat taaagcattg taattaaact ttaagcagtg catatttatg ttatagataa 1620 

gataaacaag aaatctaggg atcaaatagg attaaaatta gtagtgatca ttcagtacag 1680 

tagttacgta ctgttattca caagagtata taaatcaaat tacaaggaat taaggatata 1740 

aacgtgataa gaaagtatgc actgtactct ttgaggaagt ttgccataga aaggaagaag 1800 

aaataggatg gtagatcaga agtaaagcag gacccagtgg ggggagtgtt tgcagtgagg 1860 

cagtatgtat aatcatttaa aacatgggtt tggagtcctc tcaggttcca tgtttgtaat 1920 

"ggacataatg a taataatcc ctttcat^ttaT'aggctgttgt" gagga'ttaaaT tgtgt taatg 1 9~80 ~ 

tgcaaataac tttacacagt gcctggtata taataaatgc ttgctaccta ttaactagta 2040 

tttgtttcta aggctaattt aagtcctaga attgattgca aggattagat caggagtata 2100 

gtggacatgt tgggatttaa atatttaaat atagagatgc tttttaggac cattgttaga 2160 

accagaagag attttttacc aagttcacac agaaatgtag gtgcattggc tgggcatggt 2220 

ggctcacacc tgcagtccca gcacttggga aggctgaggc agaagaactg cttgaggcca 2280 

acattttgag accagcctgg gcaacatatt aagaccccgt ctccaccaaa aaaaaaaaaa 2340 

aaa 2343 



<210> 12 

<211> 1177 

<212> DNA 

<213> Homo sapiens 

<220> 
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<221> SITE 
<222> (1095) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (1115) 

<223> n equals a,t,g, or c 



<220> 

<221> SITE 
<222> (1142) 

<223> n equals a, t , g, or c 



<220> 

<221> SITE 
<222> (1162) 

<223> n equals a,t,g, or c 



<400> 12 

agccaccatg cccggcctag attaaaaatt tgaagacata ttctctacta tgagccaatg 60 

aaattactca ttttgtttct atcccatttg ctgtcccttg cttttggaat tttgtgtctt 120 

agtgtgactg tgattctttc tctccttttg tctttcagca aacggggatt cagcgtccga 180 

tcctttggaa cagggactca cgtgaagctt ccaggaccag ctcccgacaa gcccaatgtt 240 

tatg'atttca aaaccacata tgaccagatg- tacaatgatc ttcttaggaa agacaaagaa 300 

ctctatacac agaatgggat tttacatatg ctggacagaa ataagagaat caagccccgg 3 60 

ccagaaagat tccagaactg caaagacctg tttgatctga tcctcacttg cgaagagaga 420 

gtgtatgacc aggtggtgga agatctgaat tccagagaac aggagacctg ccagccygtg 480 

cacgtggtca atgtggacat ccaggacaac cacgaggagg ccaccctggg ggcgtttctc 540 

atctgtgagc tctgccagtg tatccagcac acggaagaca tggagaacga gatcgacgag 60 0 

ctgctgcagg agttcgagga gaagagtggc cgcacctttc tgcacaccgt ctgcttctac 660 

tgagcccagc gcccgcatgg agccgcctct ggagcttcct gttgttcata ctttttcctt 720 

cctgacattt gtttttactt acaggtgttc tgctggtgac ggtagcatta cccaaataaa 780 

ctgtgcatat gaaatgggag aggagatgcc aaaacgccag atgaaagcaa tcaagtttct 840 

tcttttccac ttttacttat gagcrggata ttgattacaa agtttttctt ctttaaccaa 900 

aaaggaaaga caacggtttg tgtgcacttc ccgacatacc tgtgtcttcg tgtgcctgcc 960 

ttccctccct cctccccacc gggccggact gtacagagcc ctgctgcggc gtgttaggaa 1020 

tgacctggaa ttgtcaataa acagatgctg ctgtcaaaaa aaaaaaaaaa aaaaaaaaaa 1080 

aaaaaaaaaa raaancaaaa aaaaaaaaaa aaggnggggc cgaaggtttt ttccctttgg 1140 

._ -tnggggf tat __ f tffggctt'g - gnaft'ggcct r "t"C"gt't'tt 1 - 1 - 77 _.. 



<210> 13 

<211> 2107 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> SITE 
<222> (149) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (487) 
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<223> n equals a,t,g, or c 



<400> 13 

tttaggtatg catataaaag aaaacaaaat atttaaaaca cttaaaggag atctgacgaa 60 

acctaaagag aagaaaaata ataaaattaa gtaaagaaaw ggtatggcag gattttatgt 120 

ttgcctgtgc cctttatcca actgatcang gcttcctgga wtcagtgaca gcagaagttg 180 

cctaccagat caagagactg aaatctcatc cttctatcat catatggagt ggcaataatg 240 

aaaatgagga ggcgctgatg atgaattggt atcatatcag tttcactgac cggccaatct 3 00 

acatcaagga ctatgtgaca ctctatgtga aaaacatcag agagctcgta ctggcaggag 3 60 

acaagagtcg tccttttatt acgtccagtc ctacaaatgg ggctgaaact gttgcagaag 420 

cctgggtctc tcaaaaccct aatagcaatt attttggtga tgtacatttt tatgactata 480 

tcagtgnatt gctggaactg gaaagttttc ccaaaagctc gatttgcatc tgaatatgga 540 

tatcagtcct ggccgtcctt cagtacatta gaaaaggtct cgtctacaga ggactggtct 600 

ttcaatagca agttttcact tcatcgacaa catcacgaag gtggtaacaa acaaatgctt 660 

tatcaggctg gacttcattt caaactcccc caaagcacag atccattacg cacatttaaa 720 

gataccatct accttactca ggtgatgcag gcccagtgtg tcaaaacaga aactgaattc 780 

taccgccgta gtcgcagcga gatagtggat cagcaagggc acacgatggg ggcactttat 840 

tggcagttga atgacatctg gcaagctcct tcctgggctt ctcttgagta cggaggaaag 900 

tggaaaatgc ttcattactt tgctcagaat ttctttgctc cactgttgcc agtaggcttt 960 

gagaatgaaa acaygttcta tatctatggt gtgtcagatc ttcactcgga ttattcgatg 1020 

acactcagtg tgagagtcca tacatggagc tccctggagc ccgtgtgctc tcgtgtgact 1080 

gaacgttttg tgatgaaagg aggagaggct gtctgccttt atgaggagcc agtgtctgaa 1140 

ttgctgagga gatgtgggaa ttgcacacgg gaaagctgtg tggtttcctt ttacctttca 1200 

gctgaccatg aactcctgag cccgaccaac taccacttcy tgtcctcacc gaaggaggcc 1260 

gtgg'ggctct gcaaggcgca gatcactgcc- atca.tctctc agcaaggtga catatttgtt 1320 

tttgacctgg agacctcagc tgtcgctccc tttgtttggt tggatgtagg aagcatccca 1380 

gggagattta gtgacaatgg tttcctcatg actgagaaga cacgaactat attattttac 1440 

ccttgggagc ccaccagcaa gaatgagttg gagcaatctt ttcatgtgac ctccttaaca 1500 

gatatttact gaaggaatct aggttgtatt ttcagtggac aatgggaata aagcatttct 1560 

aaagcaccga ctggagagga aggcaacaga gacaaggaga gaagccgaga gacatgtctg 1620 

cgtgctgcca cgcatctgag cgattgctct gtgaagagtt gtacactgaa cattttcagg 1680 

ggaggctgtt tacccaggca atgtcctcaa acaagcctgt gccggggtgt cctggaatct 1740 

gtgccaggac tgtgttttta gcccttcacc tctcagcttt agcaggacat gaaccagtta 1800 

taacaagatg sccctgcagc tggttacaag aatgtgacat ggcaggatct atggaaccaa 1860 

atggaaggtt ttgaggtgat gtaggtcttt cacagttagc tttggggaat acagaatact 192 0 

caaataaagt gctttgttat tatttcagag ggaatggcga ttgaaatgtt acaacagaga 1980 

tttcttggtg gtagctattt gggtaaaggt atatggatat ttttctgtac atgtgaaatt 2040 

atataaaaat aaaagttata taaattacat tgaaaaaaaa aaaaaaaaaa aaaaaaaggg 2100 

cggccgc 2107 



<210> 14 
<211> 1262 
<212> DNA 

<213> Homo sapiens 



<400> 14 

cctaatggcc cgasctgaat acttgaagga gctcaagatg agggaatctc gctgggaagc 60^ 

tgacaccc tg gacaaagagg gactgtcgga atctgttcgtf agctcttgca cccttcagtg 120 

accctagaag aatgattgga cagatgtgag ccatctggag cagaggggca ctaacccagg 180 

ctgacgccaa gaatgaagtg gcccactgca gccctggcga gcaggcttct tggatggaca 240 

gtgctgagac ccccatatcc cagagtcccc agcctccctc aggttactct gcaccccaca 300 

gatggtttga tggctgtgct gtatactgga ggggagggca ggactctggg agaacagcac 360 

ttctttcatg agacctttgt tactcggtgg ttactgggtc ctgtgcctgt ccgttttggg 420 

gcatgcagcc ctctatcatt tttggctccg agaagagggc aaggggcccc cgcaggtarc 480 
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ttctgtgctt gccctcgccc tgccagcagg cagctgtgcc cctggcctgc ccttcccggg 540 

accccttatt ccaactcagc tcctctttgc actggaatgg ggcactccaa cacccctcag 600 

ggaccaccct ccccacagta tgcactcagc cccacagaac ccaccagtct ttctgggaac 660 

tcacacctgc ccgccatctt ggtactttag gttaatccct caagcatgaa agctggatct 720 

tttggggttt aagaagccca agccttgttc ctgccctggc ctagggagca ctcaggaggg 780 

ttccttggtc ctcatctctc ccacctccgt tccctctggg ccccacacta gccacagcgc 840 

gggccttgtg ctggagtttg agcctgggac agggagaggg aggcttggag acagtctgac 900 

ccagtgccct ctaggccacc cacttctagg cctgccctgc cgccgtggag ccctgggcaa 960 

gctctttccc ctttctgggc ctgggtctcc ccatctcttc aatggggctg ataccttcac 1020 

agcccacagc atgggcactt atgaggacaa agtgaattta acctggaaaa gaatgtattt 1080 

gagagtttct tttaaataat cagcgggtgt tggtgatttg tagcccttct gcccttaaat 1140 

gcttccttgg gcaagagctg tctgtcctcc ctgcaggagg ctgagtgtga agagtatcat 1200 

tcattgtttc tctattaaat tattttctgc taaaaaaaaa aaaaaaaaat ttctgcggtc 1260 

eg 1262 



<210> 15 
<211> 759 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (16) 

<223> n equals a,t,g, ore 
<220> 

<221> SITE 
<222> (22) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (36) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (51) 
— <223>- n equals a, t,g, - or c — 

<220> 

<221> SITE 
<222> (52) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 

<222>— (57) 

<223> n equals a t t,g t or c 

<220> 

<221> SITE 

<222> (58) 

<223> n equals a,t,g, or c 



WO 99/47540 



PCTYUS99/05804 



8 



<400> 15 



ggattaacaa 
aaaaagctta 
ggaattcccg 
ggtcaccagc 
ccctatcaag 
cgcccgtgtg 
gaagccgaaa 
tccaggcacc 
gcccgaccat 
ccggttgtgg 
catctaccac 
caggctgttg 
ggccgggaaa 



attttncaca 
tttttaggtt 
ggtcgaaccc 
ctggtggttg 
atgcaagtca 
gtggagcctc 
ctcttgacca 
aaggcctgga 
gacagcctgt 
gtgatgccaa 
ccccagtagg 
ggactgggac 



cnaggaaaac aggttnttga cccaattagg nnttttnnca 

gacacttatt agaagttacg ccttgcaggt taccggttcc 

caaggggttc gcggacccca gacatgagga ggctcctcct 

.tgctgctgtg ggaggcaggt gcagtcccag cacccaaggt 

aacactggcc ctcagagcag gacccagaga aggcctgggg 

cggagaagga cgaccagctg gtggtgctgt tccctgtcca 

ccgaggagaa gccacgaggt cagggcaggg gccccatcct 

tggagaccga ggacaccctg ggccgtgtcc tgagtcccga 

accaccctcc gcctgaggag gaccagggcg aggagaggcc 

atcaccaggt gctcctggga ccggaggaag accaagacca 

gctccagggg ccatcactgc ccccgccctg tcccaaggcc 

cctccctacc ctgccccagc tagacaaata aaccccagca 

aaaaaaaaag ggcggccgc 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
759 



aaaaaaaaaa 



<210> 16 

<211> 1810 

<212> DNA 

<213> Homo sapiens 

<400> 16 

cacgagggtg tgcgtgctta ggcaggaacc cagttttact ttatgccatg tggaaagttt 60 

ctttttccag tatcaccagt gagttcactg tctctccact ggtctgcagt . gctgcttctg 120 

ttacttgcag acttcccacg tgtgcatgga tctccacctg gggtbtctag ggtctctatt 180 

ctacactgcc tatttccctt tctgtcctaa caccatagca tttaactcac ccgtcatcct 240 

gtgttgctga gaatttcctt catagaactc atcaaagtat gattaactgt gctccctgag 300 

ggcaggaatt atgccatctg gatcaccagc ctctcccttg tccttagcac gccatctgca 360 

aattagcaga tactcggtaa atgtgtatta actcgaagta tattttgtgt cttctctgtg 420 

cacagcactg ccctgggaag aactaggatg aggtattgac ttgctgttgc cacataacaa 480 

accctgccag aactccctgg atggaagtga ccaccgtgta tctgtggatt gtctgcaggg 540 

ctctgctggg gtcagcaggt cccacaacag agccagggct cggtctcctc atggctgtca 600 

gaggtttacg tattccgcct cctcccacca aagtctgaag ttgttgtatt ccattccttg 660 

ctatatccac atcttttaat aatgctaaaa tcccgtgttt ctctaaagca ttggattgaa 720 

ccaactgaag aaggaccacg tgtgttgctg ggcctgcttg ggcacaagcc gtttccgatc 780 

caagtcaact gctggtctgc ttagacgaag gtgtgtgggt gtctccacca cggagaggag 840 

ggacagcagg tgagaccata ggccaggaag gaagggcaca gcctaagcgt gcagtggctt 900 

agccagagac cctcgtgcac cagccttcca ggtgcttatt ggaacttatg tcagcccagg 960 

ccatatccaa gtgtgtgatg Tctcggagca"tatatgccag gccagccgga gaggcttagc 1020 

cctgccctgg tggagctgga gggccgcagg gccgcccggt ggggtcagga ggttgtgaag 1080 

aggatcctga tacaggctgg gcctccctgc aggcgtgagc cccggagcac ggggtgagca 1140 

gctccaccca gaggggcttg caggaccaag ctgggacagc aaccaccagg ccctggggca 1200 

gatcagtgag cgtccaggag atgcagatgc agaagacagc caaattcatt cacctctgcg 1260 

tgggcctgtg agggcccaca gagatgcatt ttcattcacg accaggattt cctcggccgg 1320 

agcagccgct tttcccagcc gaagctcact gtgtttacta cataggatgt gagtgtatag 1380 

aaagactctc tctaacgtta gtacgcgtgc agaaatgtgg ggccgcttac aagtgtgggc 1440 

agccgcagcc tgt t cc tcac ccctgtccta acgggacata ctccacgc at gca cattta g 1500 

gatcaccgtg tcttctcgtt ggactgatct gtcattagga ccctggaccc aagtaattgt 1560 

ctttgctctg aagttttgac agtaacaaag gcattccagc tctttctttt tcactcctgt 1620 

cggtgtaacg tgccgttttt catcctttga cttttagccc gcctgtgccc tgtctgaagg 1680 

gagttgtctg tggacagtca cggagtggtg ggtgtttgta atccactctg ccagcctcag 1740 

tcttctaact gttgcgtatg gaccaattac atctgccctt tctcttccct gctaaaaaaa 1800 

aaaaaaaaaa 1810 
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<210> 17 

<211> 1052 

<212> DNA 

<213> Homo sapiens 



<400> 17 

gcaattttct gcatagcatc agcaatgagt ctgtacaact gtcttgctgc actaattcat 60 

aagataccat atggacaatg cacgattgca tgtcgtggca aaaacatgga agtgagactt 120 

atttttctct ctggactgtg catagcagta gctgttgttt gggctgtgtt tcgaaatgaa 180 

gacaggtggg cttggatttt acaggatatc ttggggattg ctttctgtct gaatttaatt 240 

aaaacactga agttgcccaa cttcaagtca tgtgtgatac ttctaggcct tctcctcctc 300 

tatgatgtat tttttgtttt cataacacca ttcatcacaa agaatggtga gagtatcatg 360 

gttgaactcg cagctggacc ttttggaaat aatgaaaagt tgccagtagt catcagagta 420 

ccaaaactga tctatttctc agtaatgagt gtgtgcctca tgcctgtttc aatattgggt 480 

tttggagaca ttattgtacc aggcctgttg attgcatact gtagaagatt tgatgttcag 540 

actggttctt "cttacatata ctatgtttcg tctacagttg cctatgctat tggcatgata 600 

cttacatttg ttgttctggt gctgatgaaa aaggggcaac ctgctctcct ctatttagta 660 

ccttgcacac ttattactgc ctcagttgtt gcctggagac gtaaggaaat gaaaaagttc 72 0 

tggaaaggta acagctatca gatgatggac catttggatt gtgcaacaaa tgaagaaaac 780 

cctgtgatat ctggtgaaca gattgtccag caataatatt atgtggaact gctataatgt -840 

gtcattgatt ttctacaaat agacttcgac tttttaaatt gacttttgaa ttgacaatct 900 

gaaagagtct tcaatgatat gcttgcaaaa atatattttt atgagctggt actgacagtt 960 

acatcataaa taactaaaac gctttgcttt taatgttaaa gttgtgcctt cacattaaat 102 0 

aaaacatatg gtctgtgtar tttcaaaaaa - a_a 1052 



<210> 18 

<211> 1130 

<212> DNA 

<213> Homo sapiens 



<400> 18 

ggcacgaggc catttgtata attctttagt aaattgtatt aatgggagaa tctgtaagtt 60 

atgtctgaac tttcaggttg tcttataatt gtctttttcc ttatgtcaga tgttctatgt 120 

cataagaata aaatggttca caccaataca agtacttagt tgtggaaagg gagagtagaa 180 

gataaaaatg gagattttcc tgtgctacag gcttagtcaa gcttatggtc tatttaatgg 240 

ttatcaaagg caattaaata gtgttgaatg ttctgctttt acctacattt catttttcat 3 00 

gtacttagtt acaaattgaa ccctcttcta tttttttcct gctcctgttt ctgtttcatt 360 

ttagttttcc ttttccctga ttatcattta ggcatgtaag tgacacccag tagcattgct ~ 42~0~ 

ttaattctgc tggtgacagt gccaaagctt tactatactc tttttgttgt ctgttgcttt 480 

tctcttgcta atttgcttga ctagataact aagaattcag gtaagcatta gctctttgtt 540 

cactgagaat aatacaactt gcaagataat taatttggat tgttctacat gtatttcgtt 600 

tatttctctt taccttgttc atttattacg acattttgaa ttatttacat acccatattt 660 

cttctttctt ttatggctca gctcactatg ctttttttta atactggtag cttcctcaag 720 

gttggaaaac aagatctgaa tactatagaa aataataact atttttctgt ggtcatatta 780 

aagatataat ggctttggat tttggggtga tttttctact gtcagtttaa aaaaaacttg 840 

Jtctattjtgc,a_tjtt^ 900 

ttacttattt tgtttccatg tctttttcca aaagaactta ttttttatat tataataaat 960 

atcagtggaa aagtaggttt cgttatatag aaattaactt taggctgggt gcagtggctc 102 0 . 

aagcctatat ttgggaggcc gaggcaggag gattgcttga actcaggagt tcgaaactag 1080 

cgtgggcaat gtagcgagac ctggtctcta caaaaaaaaa aaaaaaaaaa 1130 



<210> 19 
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<211> 883 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> SITE 
<222> (19) 

<223> n equals a ( t,g, or c 



<400> 19 

gtcaccgtgg gcgtttaant atgatccccg gctcagattc gcagactgca ctgaacttcg 60 

gctctacgtt gatgaagaag aagtctgatc ctgagggtcc cgcgctgctc ttccctgaga 12 0 

gtgaactttc catccggata ggtagagctg ggcttctttc agacaagagt gagaatggtg 180 

aggcatatca gagaaagaag gcggcagcca ctggcctccc agagggtcct gctgtccctg 240 

tgccttctcg agggaatctg gcacagcccg gcggcagcag ctggaggagg atcgcactgc 300 

tcatcttggc catcactata cacaacgttc cagagggtct cgctgttgga gttggatttg 360 

gggctataga aaagacggca tctgctacct ttgagagtgc caggaatttg gccattggaa 420 

tcgggatcca gaatttcccc gagggcctgg ctgtcagcct tcccttgcga ggggcaggct 4 80 

tctccacctg gagagctttc tggtatgggc agctgagcgg catggtggag cccctggccg 540 

gggtctttgg tgcctttgcc gtggtgctgg ctgagcccat cctgccctac gctctggcct 600 

ttgctgccgg tgccatggtc tacgtggtca tggacgacat catccccgaa gcccagatca 660 

gtggtaatgg gaaactggca tcctgggcct ccatcctggg atttgtagtg atgatgtcac 720 

tggacgttgg cctgggctag ggctgagacg cttcggaccc cgggaaaggc catacgaaga 780 

aacagcagtg gttggcttct atgggacaac aagcttcttt cttcacatta aaactttttt 840 

ccktcctctc ttcttcaaaa aaaaaaaaaa" aaaaaaactc gag 883 



<210> 20 

<211> 989 

<212> DNA 

<213> Homo sapiens 



<400> 20 

ctggcttggc tgctatactc ttgcccttca ctgaacctca gttttcctca tctgaatagt 60 

tgggagactc attcctgcct ttctcatgtc cctggctatt tggtaaacca gccagtagga 120 

agacatcgtg aaatgtatta aagtggtctt agctagacag agtgggcatg ccagggtcag 180 

cagagattct gaagtctaga ccagttccct gggtgggccg ttgtcagtcc tagcagatgg 240 

ccaggtcagc cctcaggctg gaaattttag ggcagctatt ggtaggtgtc tcctcttgct 300 

gtgctgagat acggtcaaga tcatacttag gcttttgttg gaagaacata caagacgaga 360 

gaaaaaaaaa gatcatactt aggggctccc ggaatttgct ctgccctagg ttgctgagac" "420 

ctctagaacc tgtgcaggct aaaggaactc agtcggtaga tccgagagag gtggtcaggg 480 

agaccaggag catgtctaca ctgccagcag acttttgcct cctcccccaa gccagcagga 540 

tggcccaaaa aggctccccc agcagatcat ctttgcagct ccttttttag ctccagtggc 600 

agcagggatg aggaagggaa agttctatca tttttttcta atttaaaatg acatttaaaa 660 

tcactagcct agtgggggcc aggtgtggtg gctcatgcct gttgtcccag cactttggga 720 

ggccaagacg agtggatcgc ttgagctcag gagttaaaga ccagtctggg caacatagcg 780 

aaacgccgtc tctataaaaa aatacaaaaa ttagctggat gtggtggtgc acacctgtat 840 

_t_cttagctg<L _t t g .t .ggggc t._aaggcgaaag_ga tcacjb t_g,a_g.c,ccaqqagg_t caaggc tqt 900 

agtgagctgt ttgtgccact gcactctagc ctgggtgaca aagcaaaacc ctgtctcaaa 960 

aaaaaaaaaa aaaaaaaaag ggcggccgc 989 



<210> 21 
<211> 495 
<212> DNA 
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<213> Homo sapiens 



<400> 21 

ggtggaatgt 

ttccgacatt 

agaactacag 

cctgatgact 

cgactcaggc 

cagatgcccc 

gtggtgctca 

cccaggagtt 

aaaaaaaaac 



agtgaaaacg 
tgctgactgg 
gaaaggagca 
gtgagcctct 
tcatgtccca 
tgcaggccat 
cacctgtgat 
ggagaccaac 
tcgag 



agatgctgtc 
tcgatgctag 
gtatctctga 
tcctcctgct 
attcaaggca 
ctctccaggc 
cccaccactt 
gtaggcaata 



tctwagggac 
atgatcttga 
attatcgtgt 
tgccacctct 
gcaagaaggc 
tcaggaacct 
tgggaggctg 
cagcaagact 



taaggaagct 
gaccctgtgc 
ggaaggtcac 
cagtcacaag 
catggagcgg 
aaagaagaat 
aggcaagagg 
cccatctcta 



atctttcctc 
tgaggactgg 
ttgtctagcc 
acggctgctg 
cacctgccag 
ctacccagat 
atcacttgag 
caaaaaaaaa 



60 
120 
180 
240 
300 
360 
420 
480 
495 



<210> 22 
<211> 2317 
<212> DNA 

<213> Homo sapiens 



<400> 22 

ccctaaagag 

taaaatggac 

aagtgaatta 

ggtaaggaga 

taattcaaag 

tggaaaaagt 

acgtatatgt 

tgtgtcttgt 

tgggccgcag 

tcagcttttc 

gatagaattt 

tgtaaggtaa 

aactctttga 

tgtcgctcag 

gttcaaggga 

aagcctagct 

ctggtctaca 

ttacaggtgt 

-atgttatt.ta.. 
ttagaggttt 
cagtggcgcg 
ctcagcctcc 
catttttagt 
cgtgattcgc 
cggccggtga 
tggggaaaac 
aaatgaagat 

— actetgtcga- 
cggacatttt 
atcaaggcag 
aatgaggagg 
gaaacccaaa 
gagcgtgtct 
cctgtttaaa 
ttgggaggcc 



ttgaagaact 
tggatttttg 
caaccttaaa 
actgtaatgt 
cattgat-tta 
atgaccccag 
gcatgttacc 
atttgcatat 
tttggattta 
ctctcacaga 
gagggggaaa 
cgggaccagc 
atcaagtcat 
gctggagtgc 
ttctcctgcc 
attttttttg 
actcctgact 
gagccaccgt 
tt.ta.ttcat.g_ 
tttttttttt 
atctcggctc 
tgagtggctg 
ataggtgggg 
ccgcttcggc 
tagagttttg 
cacaaagagg 
gggacatgtt 
-ttgttttaea 
acaggtgaaa 
cccgggacca 
agatccagtg 
agtaagagtg 
accctcgtag 
aagagggctg 
gaggcgggcg 



aattggtcgg 
gtagttttgg 
tttgagattt 
ttaggat.tct 
attccacgta 
tgtcggagat 
tttacgtaca 
attagagtat 
tgtggtatgt 
ttaatctgcc 
ataacacacc 
ccaattatat 
aattttattt 
agtggcccaa 
tcagcctcct 
tattttttag 
tcaggtgatt 
gcccgacctg 
^tcttcaatag 
ttttgagacc 
actgcaagct 
ggactacagg 
tttcaccgtg 
ttcctaaagt 
aacaagacaa 
gtgacaagat 
aagtaataat 
-tataccttgt 
cagggacaca 
gagtccactc 
tggcagaggc 
gcctgagtgg 
gccattggag 
ctgtccgggc 
gatcgcgagg 



taaaaattgg 
ttgcttttaa 
cctttggtga 
gaataagtac 
gtctgttata 
ggctatgtgt 
tgtggaaaaa 
gattttccta 
tgatgaagac 
agtttctccc 
agctaatgat 
aactgattga 
aataattttt 
tcttggctca 
gagtggctgg 
tagagacggg 
caccggcctc 
aatcaagata 
gtatttacat 
gagtcttgct 
ccgcctcccg 
cgcccgccac 
ttagccagga 
gctgggatta 
agctccttgc 
aatttcagat 
aatagatgac 
-gacaacccta 
ggaaaagtaa 
tctgagaaat 
cctggggagg 
cccagagagg 
gcttacatgt 
gcggtggctc 
tcgggagatc 



atattgaatt 
aaaaattagt 
accatggaag 
tgtgttttaa 
ttcagaaaca 
gcatgtatat 
cagttctaat 
atggtcgagg 
ttagtgaata 
actgtgtatt 
gaaacgaact 
ggcattgcca 
ttgaggcaga 
ctgcaacctc 
gattacaggc 
gtttcaccat 
ggcctcccac 
ttatttaaaa 
gtctgtcttc 
ctgttgccca 
aattcacgcc 
cacgcccggc 
tggtgtcgat 
caggcgtgag 
aaagctaacc 
agggctaagg 
atttgaacac 
-agaggtaggc. 
gtaacatgcc 
ggcatttggg 
aatgggctgg 
ttgatgggag 
ggaagggaca 
acgcctgtag 
gagaccatcc 



cataagatgt 
gctagctttc 
tttacccagt 
tcacagctct 
taaaaacaag 
acaaatagac 
taagtcaata 
gcctttttgg 
gccacagtac 
gtgtatatgt 
ggctctagtc 
tttttcactt 
gtttcgctct 
cgcctcccag 
acccgccacc 
gttggccggg 
acagctggga 
gaactgtttg 
ggagatggtg 
ggctggaggg" 
attctcctgc 
taatattttg 
ctcctgacct 
ccaccaagcc 
ttgggaggtt 
gataggaaga 
tgtgccagtc 
_.accg tt atta 
cagctgttga 
caggatcgag 
aagtattcga 
gcaggagtca 
ggttctgatt 
tcccagcact 
tggctaacac 



60 
120 
180 
240 
3 00 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
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cccgtctcta ctaaaaatac aaaaacaaaa ttagcccggc atggtggcgg gcgcctgtgg 2160 

tctcagctgc tcgggaggct gaggcgaaag aatggcgtga acctggaagg cggagcttgc 2220 

agtgagccgg gattgtgcca ctgcactcca gcttgggcga cagagcgaga ctccatctc'c 2280 

aaaaaggaat tcgatatcaa gcttatcgat accgtcg 2317 



<210> 23 

<211> 1726 

<212> DNA 

<213> Homo sapiens 



<400> 23 

ctttttggct ctcattttga atttttcaag agctcatgtt ctttgtcttc attaaaaaaa 60 

aaaagttctt atatcgtgta tgaatgtcat tcgggatatc tatacataca tgcacatacc 120 ■ 

tcatttttat tgcatttcac tttattgcac tttgcaaagt gacatttttt acagattcaa 180 

ggtttggcaa ccctatgtcc ataagtctgt cagcaccatt tttctaacat atgtgctcat 240 

tttgcctctc tgtcacattt tttttttttc ctgagacagg gtcttgctct gtcacccagg 300 

ctggaatgca gtggtgtaat tatgcctcac tgcggccttg acctcttggg ctcaagggat 360 

cctcttgcct gcgcttcttg agtagctgag actacagatg tacaccacca cacacccagc 420 

taagttttaa atttttttat agagatgggg tttccctatg ttgcccaagc tgctctcgaa 480 

ctcctgggct taagtgatcc tcccacctca gcctttcaaa gtgctgggat tacaggcatg 540 

agccacagca cctggtctct gtgtcacgtt ataataattc tgccaatatt ccagatttcg 600 

tcattattaa atctgttatg gtgatctgtg atcagcgaac tctgatgtta ctgtctaatt 660 

gctttgggat gcagtgaacc cgtcagagtc atccatgagg gttggaatca acttcttcca 72 0 

aaatcctgtt aatcaagagt gaacttaatc. gattaatgtt gtgtatgt.tc tgactgetcc 7 80 

accaatctgt gggtccccta tcactctccc tctcctcagg cctccctatt ccctgagaca 840 

caataatatt gaaattagac caattaataa ccctgcaatg tgaaaggaag aagttacatg 900 

tctctcactt taaatcaaaa gctagaaatt attaagctta gtgaggaggg catttcgaaa 960 

gctgagagag gctgaaagct aggccggttg tgccaaatag ctagccaagt tgtgaatgca 1020 

taggaaaagt tcttgaagga aattaaaatt gttactccag tgaacacaca aatgttaagt 1080 

aagcaaaaca gccttattgc ttatggaaag aaagtctgaa tggtctgaat agaagatcac 1140 

atcagccaaa acatgtcctt aagccaaagc ctaatctata atagatcagg ccctaagtct 1200 

cttccattcc ttgaaggcac agagaggtga agaagttgca gaagaaaagt tggaagctag 12 60 

ccaaccttgg tttgtgcagt ttaaggaaaa aagccatctc cataacatgg aagtgcaaga 1320 

tgaagcagca ggcactagtg gggaagctgc agcaagttat acagaaaatc tagctaatga 1380 

tgagggtggc tacactaaaa acagattttc aatggagaca aaacaccctt ctattggaag 1440 

aagatgccct ctaaagcttt cataggtaga gagaggtcag tgcatgggct tgaaagaaca 1500 

ggctgattct cttgctagag gccaatgaag ccagtgagtt taagttgaag ccagtgctaa 1560 

tttatcattc tgaaaattgt agggccctta agcattatgc taaatctact ctgcctgtgc 1620 

tctagaaatg "gaatagcaaa gcacgaatga "cagcacatct gtttacaaca tgatgtgctg 1680 

aatattttaa gcctatattt gagacctact gctcaaaaaa aaaaaa 172 6 



<210> 24 

<211> 529 

<212> DNA 

<213> Homo sapiens 



<400> 24 

acgcgtccga ttacttacgt gctcctggct gggatggcac tgggcattca gaaaaggttc 60 

tccccggagg tgctgggcct gtgtgcaagc acagcgctgg tgtgggtggt gatggaggtg 12 0 

ctggccctgc tcctgggcct ctacctggcc accgtgcgca gtgacctgag cacctttcac 180 

ctgctggcct acagtggcta caaatacgtg ggaatgatcc tcagtgtgct cacggggctg 240 

ctgttcggca gcgatggcta ctacgtggcg ctggcctgga cctcatcggc gctcatgtac 3 00 

ttcattgtgc gctctttgcg gacagcagcc ctgggccccg acagcatggg gggccccgtc 3 60 
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ccccggcagc gtctccagct ctacctgact ctgggagctg cagccttcca gcccctcatc 420 
atatactggc tgactttcca cctggtccgg tgaccccctg gccccagatg gcactgagtt 480 
tttcattcat tgaagatttg atttccttga aaaaaaaaaa aaaaaaaaa 529 



<210> 25 
<211> 1755 
<212> DNA 

<213> Homo sapiens 



<400> 25 

ggcacgagcc tcacagcgcc tctgctggag ttcctgctgg ccttgtactt cctctttgct 60 

gatgccatgc agctgaatga caagtggcag ggcttgtgct ggcccatgat ggacttcctg 120 

cgctgtgtca ccgcggccct catctacttt gctatctcca tcacggccat cgccaagtac 180 

tcggatgggg cttccaaagc cgctgggggg tctgtgcctg acactcgggc tgtttgtcca 240 

agcagatctg aaatgggccg tgagctgggg gcagcagcct cccgggagca gggagtcagc 300 

cctgtgatgc atcccatcca ccctgtccac aggtgtttgg cttctttgct accatcgtgt 360 

ttgcaactgr tttctacctg atctttaacg acgtggccaa attcctcaaa caaggggact 420 

ctgcagatga gaccacagcc cacaagacag aagaagagaa ttccgactcg gactctgact 480 

gaaggcctgc gggtgccttg gcaacctgag ccacacaggc ctccacccct gcgcctcaca 540 

ggggtcgctg gcgttggagc ggaggcctgg acttctgagt tgcagagggg gctgcggaca 600 

cagcaggccc cctacagcct caggttctgc ctgagcccag cctaccaggc ttgcccctca 660 

gctcagcact gttgaccacg ctgcgtatga gggcatcttg ggtatcccac tccttctccc 720 

catttctgtc ccacaggcct tcagcccttt aacgtctctg ccaaaaacca gcacaaggag 780 

acaaagcaga gccttgtfitg tatctgggca" gcaggtgttc catgctgcta .ggtggcgggg 840 

gtcgggggtc ttctgtttca ctaacagg'aa caaagacaga aaccatgaca gggctgcccc 900 

gccaggcccc ggtgggtttg tctgcacttg gtgctcctgc ccacaccagc cactttggtg 960 

acaatgaccc ttccaagaat ctttggttca aggagcacca gttccctctt cattcttgaa 1020 

gcagggagaa attgaccttt gccttgtcgc ccaggaagtg gggctcggca cccataacta 1080 

acacctccca cccttggaaa ccatgtcttc tgggggtgag atgaccattc tgggtctaag 1140 

actgtttcaa agaagagctc atagactgac tggtccagaa gacagagggt acaacagtgg 1200 

catcacagtg acagtgtcat ggggagctgg gcgggcccag ccaaaccctc cttcttccta 1260 

gagcccagcc agcaggcagg agttcctgga ccctcaggac agtgaacttc cagacctcag 1320 

ggcaggtcta tgggccactg caggagatga gaccagcctt ctgtgttcac ctaacgattt 1380 

atactgtgta tctgtctttg atggaatttt gtaacttttt atattttttt atgcaaaagc 1440 

agcttcttaa cagatggcat tttctgtgac tctaggcctc acaaaagagc cagagttctg 1500 

gacccatgtt tggagcattt gtagccttat tctcttgcgt gtgaatctct taccctgaaa 1560 

aaaagccata atgaattaag ccagactgac cacttgcttg gagtgtgtgc ttgaaaaaac 1620 

cagagcaata ctgttgggta ttgtatcagg cttcagtaca aactggtaac accaatgtgg 1680 

atcctgacag ctttcagttt tagcaaaaat acacgtgaaa tctgactacc" atttaaaaaa 1740 

aaaaaaaaaa aaaaa 1755 



<210> 26 
<211> 1751 
<212> DNA 

<213> Homo sapiens 



<220> 

<221> SITE 
<222> (1520) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
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<222> (1557) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (1689) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (1729) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (1735) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (1741) 

<223> n equals a,t,g, or c 



<400> 26 

gggtgcagcc tgatggcgca ggaggtagac acgg.cacagg gcgccgagat . gcggcggggc .... . 60 

gcgggcgcgg ctcggggacg cgcttcctgg tgctgggccc tggcgctgct ttggctcgcg 120 

gtggttccgg gctggtcccg ggtctcgggc atcccctccc ggcgccactg gccggtgccc 180 

tacaagcgct ttgacttccg tccaaaacct gatccttatt gtcaagctaa gtatactttc 240 

tgtccaactg gctcacctat cccagttatg gagggtgatg atgacattga agtttttcga 300 

ttacaagccc cagtatggga atttaaatat ggagacctcc tgggacactt gaaaattatg 360 

catgatgcca ttggattcag aagtacatta actggcaaga actacacaat ggaatggtat 420 

gaacttttcc aacttggcaa ctgtacattt ccccatctcc gacctgaaat ggatgcccct 480 

ttctggtgta atcaaggcgc tgcctgcttt tttgagggaa ttgatgatgt tcactggaag 540 

gaaaatggga cattagttca agtagcaact atatcaggaa acatgttcaa ccaaatggca 600 

aagtgggtga aacaggacaa tgaaacagga atttattatg agacatggaa tgtaaaagcc 660 

agcccagaaa agggggcaga gacatggttt gattcctacg actgttccaa atttgtgtta 720 

aggaccttta acaagttggc tgaatttgga gcagagttca agaacataga amccaactat 780 

acargaatat ttctttacag tggagaacct acttatctgg gaaatgaaac atctgttttt 840 

gggccaacag gaaacaagac tcttggttta gccataaaaa gattttatta ccccttcaaa 900 

^"cacatttgc~c~aactaaaga^tttctgtt^ tgcagtgattf 960 

gtgcacaaac agttctattt gttttataat tttgaatatt ggtttttacc tatgaaattc 1020 

ccttttatta aaataacata tgaagaaatc cctttaccta tcagaaacaa aacactctct 1080 

ggtttataaa acaccttaat tctactgctc ttttttctcc aatcaccagc atctgttttt 1140 

cagggggtga ttttactttt gtgaattcct tagcctttct tccttggtgc ataaagttaa 1200 

aatgcacatc agcagaattg ctgcatatta acatctcagg actcttctct tgtaaagaag 1260 

ctgaaattcg tactatattg gccaaagtga gcgagttagg tgatcttggt ttcaatttcc 1320 

gagcctttgt taatatggag aattatggtt catatcagtt atgtaggacc tttggaccca 1380 

gggtcct ac a gat a gatat g gtgt gcccag attttaaaaa taccttcaaa a ataa aaaat 1440 

acattcagtg acattttcat ggtgggagct cttctttctg atatggcagt tacacttttt 1500 

cacttaagtg ctttagtttn agactaactt tacaacttct ataacttttg ggaaccnagt 1560 

ttagtatagt ctgattacat tccattcacc taactttagg cattcggttt agacaccata 1620 

actggrgkgr atkgkgcytc cyagratgtg ggcaaatccc agtggttaac accatatttc 1680 

tgggctggng attttgggga ctagctaggt aaacgggctt ggtggttcnt ttaancatac 1740 

ntaaccacca c 1751 
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<210> 27 

<211> 1212 

<212> DNA 

<213> Homo sapiens 



<400> 27 

gccaagcttg gcacgargtt ggtggcggcg tccggaggtg ctggtttgtt ctcggtgaac 60 

ggcgcgcggg gtctctcctg agtgcgagct acgggacctt cgccatgccg gggatggtac 12 0 

tcttcggccg gcgctgggcc atcgccagcg acgacttggt cttcccaggg ttcttcgagc 180 

tggtcgtgcg agtgctgtgg tggattggca ttctgacgtt gtatctcatg cacagaggaa 240 

agctggactg tgctggtgga gccttgctca gcagttactt gatcgtcctc atgattctcc 300 

tggcagttgt catatgtact gtgtcagcca tcatgtgtgt cagcatgaga ggaacgattt 360 

gtaaccctgg accgcggaag tctatgtcta agctgcttta catccgcctg gcgctgtttt 420 

ttccagagat ggtctgggcc tctctggggg ctgcctgggt ggcagatggt gttcagtgcg 480 

acaggacagt tgtaaacggc atcatcgcaa ccgtcgtggt cagttggatc atcatcgctg 540 

ccacagtggt ttccattatc attgtctttg accctcttgg ggggaaaatg gctccatatt 600 

cctctgccgg ccccagccac ctggatagtc atgattcaag ccagttactt aatggcctca 660 

agacagcagc tacaagcgtg tgggaaacca . gaatcaagct cttgtgctgt tgcattggga 720 

aagacgacca tactcgggtt gcttyttcga gtacggcaga gcttttctca acctactttt 7 80 

cagacacaga tctggtgccc agcgacattg cggcgggcct cgccctgctt catcagcaac 840 

aggacaatat caggaacaac caagacctgc ccaggtggtc tgccatgccc cagggagctc 900 

ccaggaagct gatctggatg cagaattaga aaactgccat cattacatgc agtttgcagc 960 

agcggcctat gggtggsccc tctacatcta cagaaacccc ctcacggggc tgtgcaggay 1020 

tggtggtgac tgaaattagc tggacatgg.t tgcacacacc tgtaatcaca. gctactcggg 1080 

aggttgaggc gggagaatcg cttgaaccag ggagttggag gttgcagtga gtggagatca 1140 

caccattgcc ctgcagccta agcaacagag caagattctg tctcaaaaaa aaaaaaaaaa 12 00 

aaaaaactcg ag 1212 



<210> 28 

<211> 1112 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> SITE 

<222> (1105) 

<223> n equals a,t,g, 



<400> 28 

ggcacgagca aacatccagg agtgtgcacc ggtcatgcaa ggtgttttgt ttggctttgt 60 

ctggcttttt agttttttgt ggcaggagaa taaatctagt gcctctccct ccacattagc 120 

caaaagtgga agtccctgtc cagtcagcat tccttggatg cctggtgtat tagtccgttt 180 

tttcacactg ctataataaa aagaactgcc caagactggg taactaataa aggaaagagg 240 

tttaattgac tcacacttct gcatgtttgg gaggcctcag gaaagttaca atcaggcaga 300 

aggtgaagtg cgttcgtctt aatggcggca ggtgagacag tgtgtaggat aaactgtcaa 360 

acac 1 1 ataa aac catcata gctcatga^a cctcattcac tgtcacg a ga acagcatgg g 420 

ggaaccgccc ccatgatcta atcacctccc actaggtccc tccctccacc tgtggggatt 480 

atgaggatta caattcaaga tgagatttgg gcaggggcac cgagccaaac catatcacct 540 

tatatgtgcc cagtgttgac ctaggcgctg ggatgcagaa acaaacacga catgggctgt 600 

gccttgggga gctcacactc ttgctggaga agcatgctga ttcctaaata agaaatgcta 660 

tgtgctgtgt acagagtacc atggaaggca ggatgaactc tttgggagga agaagcaagg 720 

aaagctttag agagttgctg gcttttgagg gatggagcag gcattttcta catggggaaa 780 

gtgtaggaaa gagtattcca ggcagagtgg agagcaagag caaaggcgga gaagcctgtg 840 



or c 
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ctgcgaattc cttgccgggc aggatccctg tcttactgct gtttagagat caatatatgt 900 

caagtgactg gaagtgtggt ttttgttctg ggactagtag gtagaacaga aagagttggg 960 

atggagtgag caacccatgg agaaatagag gctcggggtc agctgataca aggcgttgta 1020 

taccaagctg aggagcacaa gatttggaac ataataccaa atgctgggga gccatgggag 1080 

ggccatggga gctctgatag tgttntctcg ag 1112 



<210> 29 
<211> 748. 
<212> DNA 

<213> Homo sapiens 



<400> 29 

ggcacgagcg aaactgtttt ccaatgtggc tgaaccactc tgcatttcca ccagtaatga 60 

gaatgagagt tgctgttgct ccacggcctc accagcattt ggtggtgtca gtgtcttgga 120 

ttttagccat cctaataagt gttagtggct atcattgttt tcatttgcaa ttctcttaca 180 

tggtgtlcgaa catctttccc catgtttatt tgtcatctgc atatcttctt cggccagtta 240 

tctgttcaga tcttttgccc gtttttgttt gcttgcatgt ttgtttgtgt ttgatttttt 300 

aaagaaagct ttttttatta ttgagttgta atagtgcttg tatagtgtgg ataacagttc 360 

tctatcagat aggtcttttg caaatatttt ccccaatctg tggactgtct tctcattctt 420 

ttgataaatg gctttaaaat aataatctgg ccgggcgcag tggctcatgc ctgtaattcc 480 

agcactttgg gaggccaagg gcagatcatc tgaggtcggg agttcgagac cagcctgacc 540 

aacatggaga aaccccatct ctactaaaaa tataaaatta gtcgggcgtg gaggcacatg 600 

cctgtaatcc cagctacttg agaggctgag acaggagaat ctcttgaacc cgggaggtgg 660 

aggttgcagt gagccgaaat cgtgccactg tattccagcc tggacaataa, gagcaaaact 720 

ccatctcaaa aaaaaaaaaa aactcgag 748 



<210> 30 
<211> 778 
<212> DNA 

<213> Homo sapiens 
<400> 30 

ggaactaaaa agctttgtgt tcttcagggt gggtggcagg gggatatagt gagggtggac 
cagggagaat gaccataggg cactaagtaa ggctgggatt ggatcagcag aaatccaacc 
ctctaacctt agggtaggga gtgctaagga tctggggaaa ccatgggctg ggaagctgct 
cttgctctcc tcgtgtctgc tgtttttttc ccttggtgta ctatacagag gccagatgtt 
ggcaccacct ctccaggagg attggaaagg aggagtaaag gattctgatt tgattgatga 
ttccagtgca tccccaatcc "caccatctta~cctggaatat~aaggctgcct tgtacccctt 
ttctgagcac aagtctgtgc gtaatgcaac tgactctctt acttttttct tagtaactga 
tcatttccta gacaaccaag attctcaata agtcccagtc tcatcacaaa tattaatatt 
tccttttcct cataccaact tgactatgtt tcactgaaac ccacaggtct tgggacagaa 
tgaggcatta cctcattgaa ctttagctgc ctgcatgagt cctctgtcct caagtctttc 
tcagatcatt tctcaagctg gctcccagct tagggcaaag agaatctcca tgatgtgctg 
acttctagct tgccacagac acaattctac tccaaagtca gcctggcata gtaacattga 
tgtcagggga gacatatcag tttgaggcca tacaaaaaaa aaaaaaaaaa aactcgag 



<210> 31 
<211> 1324 
<212> DNA 

< 2 1 3 > Homo sap i ens 



60 
120 
180 
240 
300 
- 3 60" 
420 
480 
540 
600 
660 
720 
778 



<400> 31 
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acgagctaat 
ggtgaaggat 
cagtagggtc 
gattcgggag 
atacttaagt 
gctaaaatcc 
aagatgggaa 
aggtaaaatt 
ttcaaaggta 
taaagtaagc 
ccccctcagt 
taggggaaca 
tgacttttta 
caagagcctg 
gtaactcatt 
cagtagtagt 
tctttaaaac 
ctattgttat 
actttaaatc 
ataggaagaa 
aaatattaaa 
tataatcctt 
aaaa 



gattcttgct 
aaggaacctg 
cttgctaaaa 
actgactaaa 
ttttcttttt 
atgctattca 
agtatttctc 
atttttacat 
attgtaaagt 
attgattctt 
gtcccctccc 
ccacttaaaa 
agatactgca 
gagccaaact 
caactaggga 
atattaacca 
agtgcctggt 
tcaaagaagg 
cttgaggtta 
aacccttgcc 
tacattttgt 
cgaatcccac 



gaagatggcc 
atcagatatc 
ctggttttta 
gtttcatcaa 
atgcggtttt 
tgcacacaat 
tctcattccc 
ttgaaattat 
ttcattgtat 
tggctgtcag 
aaaaaatatg 
ataaagaggt 
gggttgagag 
gagattaaat 
acttaaccta 
tatagagttt 
catcacttaa 
ttatttgagt 
atgattcttt 
ccaatagaaa 
atttctttta 
aattttcatt 



agagtaatca 
aagggtgggg 
caagagagag 
gaatctttgt 
gttttaaaca 
ggctggatcc 
acaggatttg 
gttttacatt 
aaaatctggt 
atagcactac 
ccctacaaca 
aggtggtaat 
gcaagtctag 
cttagtcctg 
actgtttcct 
ttgtgaggat 
gtgttgaggt 
tatatttttt 
ggaaaagctg 
ataatacaac 
ttccattttg 
cgacactacc 



gagtaattaa 
gtactcttgc 
cacaggtagg 
caggagacat 
ggctaatagc 
ccaaatctaa 
cagatgaaga 
gctttgctct 
tttctttctt 
agaaataact 
gcaaggggca 
ggtgaagcaa 
tattatatag 
ctagttaaca 
tatctataaa 
tgagatagta 
agctgctgtt 
ccctaggctc 
gaggtgtgct 
tagaacataa 
tttgttcatg 
tttaaaaaaa 



tttggggaat 


60 


taaactgact 


120 


tctacaagaa 


180 


gacctttatg 


240 


tcgtcagctt 


300 


tctttggcta 


360 


cattaataaa 


420 


atgatagggt 


480 


tgcattgaag 


540 


gcctctccat 


600 


gaagtggaag 


660 


ggatttttct 


720 


aataagagca 


780 


t ttctgttgt 


840 


atggaaatta 


900 


tatgtaaagt 


960 


ttaaaaatta 


1020 


tccaaggtat 


1080 


gtggtaaata 


1140 


aacacaatta 


1200 


atttaagttt 


1260 


aaaaaaaaaa 


1320 




1324 



<210> 32 

<211> 739 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> SITE 
<222> (732) 

<223> n equals a,t,g, or c 



<400> 32 
ggcacgagga 
tagattgcag 
gcctgtcact 
"catttttcag" 
tctcattttt 
gtgcttactt 
ggtttacaca 
ggaggaaaaa 
tattttttaa 
tgggttgtgt 
aggcaaagtc 
aacctctgcc 



caggatcctg 

ggcatttgtt 

gctaactcct 

ttgatagttt" 

attttgctgg 

ttggagtttt 

gtaaacaatg 

aagagatata 

gcctagaggg 

taagcctaac 

tcgccctgtc 

tcctgaggca 



gtttgggtac 
tgagactatt 
tagtattaaa 
atatactttc" 
attgttttct 
gattccctgt 
tgaatgtgat 
aaggtaatca 
aactctttgt 
cctaacttct 
acccaggctg 
ggagaatcac 



cttagtttaa 
tagccacagc 
actgtcaaac 
tctgaaggat 
gttttttgct 
gtcactgttt 
caccaaaata 
ccaccaccct 
tggctctgtt 
wctctctctc 
gagtgcagtg 
ttgaacctgg 



tagaaaccca 
agggcaagca 
atgggaggta 
cctaatgata" 
tcagcattct 
tctttcgcat 
cgcacagaac 
cccacctcct 
aagtttaggg 
tctctttttt 
gcacgacctt 
caggcggagg 



ggtggaaacc 
ggaagatgca 
actgctgatg 
"gttaaccatt" 
tgcttttgct 
acacctctca 
atctgaccga 
gttttgttgt 
ttaatgtgat 
tttttttttg 
ggctcactgc 
ttttggtgag 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 



ctgaggtcgt gncattgca 



739 



<210> 33 

<211> 1462 

<212> DNA 

<213> Homo sapiens 
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<400> 33 

ggccatcggc ggggcagtcg cgggatgcgc ccgggagcca cagcctgagc tttagcccat 60 

gaggaggatg tgaccgggac tgagtcagga gccctctgga agcatggaga ctgtggtgat 120 

tgttgccata ggtgtgctgg ccaccatctt tctggcttcg tttgcagcct tggtgctggt 180 

ttgcaggcag cgctactgcc ggccgcgaga cctgctgcag cgctatgatt ctaagcccat 240 

tgtggacctc attggtgcca tggagaccca gtctgagccc tctgagttag aactggacga . 300 

tgtcgttatc accaaccccc acattgaggc cattctggag aatgaagact ggatcgaaga 3 60 

tgcctcgggt ctcatgtccc actgcattgc catcttgaag atttgtcaca ctctgacaga 420 

gaagcttgtt gccatgacaa tgggctctgg ggccaagatg aagacttcag ccagtgtcag 480 

cgacatcatt gtggtggcca agcggatcag ccccagggtg gatgatgttg tgaagtcgat 540 

gtaccctccg ttggacccca aactcctgga cgcacggacg actgccctgc tcctgtctgt 600 

cagtcacctg gtgctggtga caaggaatgc ctgccatctg acgggaggcc tggactggat 660 

tgaccagtct ctgtcggctg ctgaggagca tttggaagtc cttcgagaag cagccctagc 72 0 

ttctgagcca gataaaggcc tcccaggccc tgaaggcttc ctgcaggagc agtctgcaat 780 

ttagtgccta caggccagca gctagccatg aaggcccctg ccgccatccc tggatggctc 840 

agcttagcct tctacttttt cctatagagt tagttgttct ccayggctgg agagttcagc 900 

tgtgtgtgca tagtaaagca ggagatcccc gtcagtttat gcctcttttg cagttgcaaa 960 

ctgtggctgg tgagtggcag tctaatacta cagttagggg agatgccatt cactctctgc 1020 

aagaggagta ttgaaaactg gtggactgtc agctttattt agctcaccta gtgttttcaa 1080 

gaaaattgag ccaccgtcta agaaatcaag aggtttcaca ttaaaattag aatttctggc 1140 

ctctctcgat cggtcagaat gtgtggcaat tctgatctgc attttcagaa gaggacaatc 1200 

aattgaaact aagtaggggt ttcttctttt ggcaagactt gtactctctc acctggcctg 1260 

tttcatttat ttgtattatc tgcctggtcc ctgaggcgtc tgggtctctc ctctcccttg 1320 

caggtttggg tttgaagctg aggaactaca aagttgatga tttctttttt . atctttatgc 1380 

ctgcaatttt acctagctac cactaggtgg atagtaaatt tatabttatg tttccctcaa 1440 

aaaaaaaaaa aaaaaactcg ag 1462 



<210> 34 
<211> 2815 
<212> DNA 

<213> Homo sapiens 



<400> 34 

gggtcctgga gtgccctcgg ctgatagaga ctatagttcg agagttcttg cccaccagtt 60 

ggtctcctgt gggggcaggg cctaccccta gtctatacaa agtaccctgt gctactgcca 120 

tgaaactact tcgtgtcctg gcctcagctg ggaggaatat tgctgcccgg ctgttgagca 180 

gctttgatct ccggagccgc ctgtgccgca tcatagctga ggctccccaa gaactggcct 240 

"tgcccccaga " ggaagctgag atgctgagca _ ccgaggccct — ccgtctgtgg - gctgtggctg 300 

cctcctatgg ccagggcggt tacctttaca gggagctcta cccagtgctg atgcgggcct 360 

tgcaggtggt gccgcgggag ctcagcaccc acccacctca acccctgtcc atgcagcgga 420 

tagcctcact gctcactctc ctcacccagc taaccctggc agccggcagt acccctgctg 480 

aaaccatcag tgattctgct gaggccagcc tctcggccac cccttcctta gtcacttgga 540 

cacaggtgtc tgggctccag cctcttgttg agccgtgtct aaggcagacc ttgaagttgc 600 

tgtccagacc tgagatgtgg agagccgtgg gcccagtgcc ' cgttgcctgc ctgttgttcc 660 

tgggagccta ctaccaggcc tggagccagc aaccaagctc atgcccggag gattggctcc 72 0 

^g.g^catgga_gcgcctgtca^ gagagctgcrt gctgc cactg ctg agtca gc ccacact ggg 780 

cagcctgtgg gattccctta ggcactgctc ccttctctgc aacccgctgt cctgtgtgcc 840 

agcccttgaa gctcccccca gcctcgtgtc actgggctgc tcgggaggct gcccccgtct 900 

cagtctggct ggctcagcct cacccttccc attcctcact gccctcctct ctcttcttaa 960 

taccctggcc cagatccaca aggggctgtg tggccagctg gctgccatat tggctgcccc 1020 

gggactccag aattacttcc tccagtgtgt ggctcctggg gctgccccac acctcacacc 1080 

tttctctgca tgggccctgc gccatgagta ccacctgcag tacctggcac tcgctctggc 1140 

ccagaaagcg gcagcgctgc agccactgcc agccacccat gctgccctct atcatggtat 1200 
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ggccttggcc ctgctgagcc ggctgctgcc cggaagtgag tacctcaccc atgagctgct 1260 

gctgagctgt gtattccggc tggagttcct cccggaaaga acatcagggg gtccagaggc 1320 

agccgacttc tctgaccagc tgtcgttagg aagcagcaga gtccctcggt gtgggcaagg 1380 

gactctgctg gctcaggcct gccaggacct ccccagcatc cgcaactgct acctgactca 1440 

ttgctcgcca gcccgagcca gtctgctggc ctcccaggct ctgcaccgag gggagctaca 1500 

gcgagtccca accctgctac tgcccatgcc tacggagccg ctgctgccca ccgactggcc 1560 

cttcctgcac tgattcgcct ctacaccggg cttcagacac cccctcggga ctctctccac 1620 

agacaccatg ggcacagcca tgcgggtcct gcagtgggtg ctagttttgg agagctggcg 1680 

cccccaggct ctctgggctg tgccccctgc tgcccgcctg gcacggctca tgtgtgtgtt 1740 

cctggtggac agtgagctgt tccgggagtc cccagtacag catctggtgg cagccctcct 1800 

cgcccagctc tgtcagcctc aagtcttgcc aaacctcaac ctggactgcc gactccctgg 1860 

cctgacgtct ttccctgacc tctatgccaa cttcctggat cattttgagg ctgtctcttt 1920 

tggggaccac ctctttgggg ccctggtcct cctgcccctg cagcgtcggt tcagtgtcac 1980 

cttgcgcctt gccctctttg gggaacacgt gggagccttg cgagctctga gcctgcctct 2040 

gacccagttg cctgtgtccc tggagtgtta cacagtgcct cctgaagaca acctggccct 2100 

ccttcagctc tacttccgga ccctggttac tggtgcgctc cgcccacgtt ggtgccccgt 2160 

gctatatgct gtggctgtgg ctcatgtcaa tagcttcatc ttctctcagg acccacagag 2220 

ctcagatgag gtcaaagctg cccgcaggag tatgctgcag aaaacatggc tgctggcaga 22 80 

tgagggtctc cggcagcacc tcctgcacta taagcttccc aattccacgc tcccagaggg 2340 

ctttgagctc tattctcagt tgccccctct gcgtcagcac tacctccaga gactgacttc 2400 

aacagtgctc caaaatgggg tatcagagac ctaggatagt tgatatagat ggaaagatgg 24 60 

gtacgttgtc ctgtatccag cctttcaaca gatgtctggc cagacgaaga acattgtgtc 2520 

ctaatggtag gcaggagacc aaggagcaga aggcttgcct tcctgggagc aggttgtttg 2580 

agctgtttta gagcagtgag ccctaccatt acatcctgat atctggggct tctgaaggtc 2 640 

tgtgctggga gtgaagagtg gcttagctat ' tjiacccgctc tttggggaca . gggcaaacta 2700 

aatgcatccc ttcttaccta actcccaacc cctgccctgg gctgaggcat atgaatgcta 2760 

tagttgtgca ttaaaataaa tgttttttat ctcctggaaa aaaaaaaaaa aaaaa 2815 



<210> 35 

<211> 1078 

<212> DNA 

<213> Homo sapiens 



<400> 35 

ggtgggctct gtgctgggtg ccttcctcac cttcccaggc ctgcggctgg cccagaccca 60 

ccgggacgca ctgaccatgt cggaggacag acccatgctg cagttcctcc tgcacaccag 12 0 

cttcctgtct cccctgttca tcctgtggct ctggacaaag cccattgcac gggacttcct 180 

gcaccagccg cc 9tttgggg agacgcgttt ctccctgctg tccgattctg ccttcgactc 240 

tgggcgcctc tggttgctgg tggtgctgtg cctgctgcgg ctggcggtga cccggcccca 300" 

cctgcaggcc tacctgtgcc tggccaaggc ccgggtggag cagctgcgaa gggaggctgg 360 

ccgcatcgaa gcccgtgaaa tccagcagag ggtggtccga gtctactgct atgtgaccgt 420 

ggtgagcttg cagtacctga cgccgctcat cctcaccctc aactgcacac ttctgctcaa 480 

gacgctggga ggctattcct ggggcytggg cccagctcct ctactatccc cccgacccat 540 

cctcagccag cgctgccccc atcggctctg gggaggacga agtccagcag actgcagcgc 600 

ggattgccgg ggcyctgggt ggcctgctta ctcccctctt cctccgtggc gtcctggcct 660 

acctcrtctg gtggacggct gcctgccagc tgctcgccag ccttttcggc ctctacttcc 720 

— accagcact.t_ggcaggc.tec_tagc£^ 780 

ctggggcagc gggacactag cctgccccct ctgtttgcgc ccccgtgtcc ccagctgcaa 840 

ggtggggccg gactccccgg cgttcccttc accacagtgc ctgacccgcg gccccccttg 900 

gacgccgagt ttctgcctca gaactgtctc tcctgggccc agcagcatga gggtcccgag 960 

gccattgtct ccgaagcgta tgtgccaggt ttgagtggcr agggtgatgc tggctgctct 102 0 

tctgaacaaa taaaggagca tgccgatttt taaaaaaaaa aaaaaaaaaa aaaaaaaa 107 8 
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<210> .36 

<211> 1217 

<212> DNA 

<213> Homo sapiens 



<400> 36 

cggcacgagg ttgaatgtta gccctggagg agatccatgt cttactcgct ctttctggcc 60 

cttctgtctt ttgcctctgc aattcttttt gtagctggca cgatagcagg gactgggggt 120 

ctatcctttc atggtattgc tacaatattt gtccttactg gaaaatggta acatccgggt 180 

ctgatttaat tggcattaca cttacacagg gactctgagc acccccgtca ccacaccaga 240 

cagtggacca gttttcacag ctacaaagag ctagaaatgt gtttaacatc atccagtgca 300 

tcccctaatt caaaaccatc ctcactaatc aatcatattc acccataaat attacaaatg 360 

agattgattc catctcaaga caatttgtca aatacttaat tttcttcctg gatgattcta 420 

cttactggat attttagaaa gagaaatgtc tgagataaaa tccctcacat ttactcaata 480 

taacaaatta ctgtttctac tcctattctg agtagtgctt ctgaagattg tttgctgtag 540 

tgttgtcttt gataaaatga atgtcagtag tgagcctttt agagatacca tgctcagaaa 600 

tcctctttgg gatcagaaga tacctaaaat tctccccttt tgcceacttg gttagatgag 660 

tgatatattc tttggatcct gcaaagaaga. gattggtttc ttttcttttc tggtggtggt 720 

agtggttgta tctgtggctg tgatggttgt tgttacttgt ctctctctct ctctggctct 780 

ggcttttgct ttcctgctag tgttctttct ctttccaaac aaatagttaa attaaacgtg 840 

agcttctgaa ttgtacttgt tcatactttc aaaacataac agattaataa aaatagatgt 900 

gtcctgattt aaaacatgcc ccctggaaag gcatgctgta ttatgaaatc atgataatat 960 

aactgcatta ttacatggca gtataaatat tagtctgttg aattcatttg tccaattgta 1020 

taactttgtg gagcagtgtt ttgacctttg atacataatt ctggagcaag tggagtggtt 1080 

gcaggcagat gagacagtgt tatatcagga tttttcaatc aactttagtt. ggaggcctgg 1140 

caattacaaa catcttcaga tgtttctgta accattataa atatgaaaaa aacctcttca 1200 

aaaaaaaaaa aaaaaaa 1217 



<210> 37 

<211> 1282 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> SITE 
<222> (153) 

<223> n equals a,t,g, or c 

" <220> 

<221> SITE 

<222> (1220) 

<223> n equals a,t,g, or c 



<220> 

<221> SITE 
<222> (1222) 

<223> n_ equals _a_^ t , g o r c 
<220> 

<221> SITE 
<222> (1232) 

<223> n equals a # t,g, or c 



<220> 



ouerwirv *\*tr\ nt\A-rK.A(\A 1 t * 
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<221> SITE 
<222> (1246) 

<223> n equals a, t,g, or c 
<220> 

<221> SITE 
<222> (1282) 

<223> n equals a,t,g, or c 



<400> 37 

actcgtgccg aattcggcac gagccattct gagtttggtc ccttcccaaa agtaggggtt 60 

ttgtgtggaa aatctgagca aacctctgtt gactgttctg gggtggagtg aagggagaar 12 0 

gggctcagct aaagaacatg gggagattag ggnaacaatg ccttttattt cttgctttta 180 

aagcaatttc aggagttttc ttcctctttt ggcgtcctgc tgactccaca gagcggaaca 240 

cccaaagctg ggactttcca cctctctaat gctcagtgaa gagcgggcca ggggggtgtg 3 00 

gaaaagaaag ggtcctggag gagcccaaat tacgaatggc tagagactgg cattggcaag 360 

cgaggaggct tcgtcacagt gtagtcttcc ggttgtccga gggtactgtc ccaggggctg 420 

gggggtrttc cgtcttctgc agatcaactc ccgcaggcta aatgtggaca tcgcggtatc 480 

atgcttgata aacggaccaa taatcaagtg gagattcatt agaaccacat aacccatact 540 

aggttgattt ctcaagtata agscctggtc tgttgcccag sctggagtgc actgacacca 600 

tcatggttca ctgcagcctc aaactcctga gcccaagtga tycctcccac tcagcctcac 660 

aagtagctaa gactagaggt gtgcaccatc amacccagct aatttttaaa gttttttttg . 720 

tararatggg gtcccactct acaaaatatw taagtataag gcckggtctg ttgcccargc 780 

tgggsaaccc ttggactaag gcaatcctcc agcctcagcc tcctaaagtg ctgggattac 840 

aggcgtgagc caccgcaccc acctctagga tctctactat tgaggaaaaa ttggaggcat 9.00 

caaactccaa gggcaaaaca -tgaagactcg ctggcccacc atggatggag gttttctctc 960 

ttaaaattcc cacagcaccg catggaactg cctctcctgg gacctcagcg tttccttctt 1020 

tgctctaagc aatagcctct gccactggag attctgagat ggccgatttc cttttggata 1080 

tttaagtttt gaaatcatgc tcatttggca taggaatgtt tcacttcagt ctcctttaaa 1140 

caaaaggaca cacaaccacg attgcccctc cctcccgaag ggtcactgga cttcatgcat 1200 

cagtaatgtt tccaaaaatn tnttaagtac cnacatgcag tggccngctt ttcatttttc 1260 

caagtgaagc catcagaaaa an 1282 



<210> 38 
<211> 559 
<212> DNA 

<213> Homo sapiens 



<400> 38 

gattcggcac gagctgaagc cctgggtgcc actgctggcc cagcagggag gaggttgctg 60 

ctgctcgggc tgaagtgagg tgtgggtctg gctgggcctc cagtttccca cctgggcctt 120 

gattgtgagg aaggcctggc ctggctgcag aagcccagaa gcacctgagt aggagagttc 180 

ctttgtccca cctgcagctc attcaagcct gtgcatgggg gttggggtcc tcaggatctt 240 

gctttcctgt ttaggggagg cagccccaaa gagtgctggg accagtttgg agagtgctaa 300 

ggaatgctgg tctgcagcga ccctacttgt gctctgcgtc ctctgccaac tgcagcatgg 360 

gtgaacatct gtacatctgt ccccataatg aaaatggcct cagcaaataa caaaaatatt 420 

_ accatttagc a atca g gcac ttattaaa ag cctg gcccaa t aaacttaa a aaaaaaaaa a 480 

aaaaactcga gggggggccc ggtacccaat tcgccctata gtgagtcgta ttacgcgsgs 540 

tcamtggccg tcgtttaca 559 



<210> 39 
<211> 803 
<212> DNA 
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<213> Homo sapiens 



<400> 39 

ggcagagcta ggccaggcag agcctagctc ttgccagggc agcaggaagc cacacagtgt 60 

gttgaagccg gagcaggaga gggggccctg actcccatgt gtccttgcag gcaggagcag 12 0 

ttcgtggact tgtacaagga gtttgagcca agcctggtca acagcaccgt ctacatcatg 180 

gccatggcca tccagatggc acctttcgcc atcaattaca aagtaaggcc tgggccctgc 240 

craaaacattc actgtctgcc cacccagccc caccccatga agccatctgt ccctcatccc 300 

cacagggccc gcccttcatg gagagcctgc ccgagaacaa gcccctggtg tggagtctgg 3 60 

cagtttcact cctggccatc attggcctgc tcctcggctc ctcgcccgac ttcaacagcc 420 

agtttggcct cgtggacatc cctgtggagt tcaagctggt cattgcccag gtcctgctcc 480 

tggacttctg cctggcgctc ctggccgacc gcgtcctgca gttcttcctg gggaccccga 540 

agctgaaagt gccttcctga gatggcagtg ctggtaccca ctgcccaccc tggctgccgc 600 

tgggcgggaa ccccaacagg gccccgggag ggaaccctgc ccccaacccc ccacagcaag 660 

gctgtacagt ctcgcccttg gaagactgag ctgggacccc cacagccatc cgctggcttg 720 

gccagcagaa ccagccccaa gccagcacct ttggtaaata aagcagcatc tgagatttta 780 

aaaaaaaaaa aaaaaaactc gag 803 



<210> 40 

<211> 1510 

<212> DNA 

< 2 1 3 > Homo sapiens 

<220> _ ; 

<221> SITE ~~ • : • 

<222> (426) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (454) 

<223> n equals a, t,g, ■ or c 
<400> 40 

cacgagaaac attctatctt ttatcaaatg tgtgattcat aacttttgga taccaaagga 
atctaacgaa ataaccataa tcatcaatcc atacagggag actgtgtgct tctctgtgga 
gcctgtcaag aagatattta actatatgat acatgtgaat cgaaacatca tggatttcaa 
actcttcctt gtgtttgtgg caggagtttt tcttttcttt tatgcaagga ccctggagtc 
aaagccctac tttctattac tec tcgggaa" ctgtgctagg tgttctaatg acatagtctt 
tgtcttgctg ttggtgaaaa gattcatccg aagtatagca ccttttgggg ctctaatggt 
tggttgttgg tttgcctcag tttatattgt atgccagttg atggaagatc tgaagtggct 
gtggtntgaa aacaggatat atgtatcagg ctangtcttg atagttggat ttttcagctt 
tgttgtttgt tacaagcatg ggccccttgc acacgacagg agcagaagtc ttctgatgtg 
gatgetgega ctcctctccc tggttctggt ctatgctggt gtggctgtgc ctcagtttgc 
ctatgcagcc ataatcctcc tcatgtcctc ctggagtctg cactacccac tgagagcatg 
cagttatatg aggtggaaaa tggagcagtg gtttacatca aaagagctgg tggtgaaata 
^t^a^ggaa_ga^gagtaca ggga gcaagc tgat gctgaa acga acagtg ctctggagga_ 
gctacgccgg gcctgccgaa aacccgactt tccctcatgg ctggtcgtct ccagactcca 
cactcctagc aaatttgcag attttgttct tggaggaagc cacttgtcac ctgaagaaat 
cagtctgeat gaagagcagt atggccttgg gggtgccttc ttggaagagc agctctttaa 
cccgagtact gectgacatg cgaccttcaa gttgacttca ttctggacaa ggaagtgggc 
aaagggcagg attctattaa agttaggcag aactgttcta gtgaacggtg gcaaaaacat 
ttgctgtgga gaaaaacaag tcagtctgga aaggaaaacc aacccatttt gaagataact 
tagcattctt ggtgacttct gctacttatt gtactgtagg tggataccaa aattctgtga 
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cagccactac cacttacctt gaatgaaggc tttcattagg aacaggggaa tggcgttgtt 12 60 

cttaaggggc tagtaagcat gaacaggtgc tttgtcgaca ccagggcact aaatctggtc 1320 

ttaatcccct gaacctgtgt cagaagactc tgcaatactc ttcctatagt tcgtcagtat 1380 

aagtccttaa agagacctga gacatgctgg accagtgttt tccaaagtac agctcacagg 1440 

ctactaccaa gtgttggtca ataaaggtat tctgaggtca actaagattg ataaaaaaaa 1500 

aaaaaaaaaa 1510 



<210> 41 
<211> 1095 
<212> DNA 

<213> Homo sapiens 



<400> 41 

gcttggtggt gctatttgct tcttcaaatt ctcgttattt aaaatatttc tttcttgtac 60 

cgttgattct gggatcagcc tggatgtgtc aaacactgcc tgccaggctt agagctcagt 120 

gcatttcttc ccttttattc ctgctgatgg gattgctggc catgaccggt gagaggaatc 180 

aaggaaccca ttactatgag ttctcaggat tcatcttcaa atctcaaatg atgtggtcaa 240 

ttaaaccaaa ttaaaaacaa' gctcttgtta aaagcaagtt aaaaacaagc tcttgacctt 3 00 

gagaagaaat gattggtatt aggaagactg ttgagctgat actgcccttc attcattctc 3 60 

taccctggtg cttggataca ggagcaaagt aagaaaataa tcacagcttt attgagggct 42 0 

ctatgagcaa ggcttggtga ggatggaaga gaatggagct atcagttgat gagaacctac 4 80 

taggtgttga gctccttaca ttcattgcct atttaaaact ttctaacaac ttcatgtgta 540 

agcgttgtcc cgatttaaaa aaaaaaatag atgtggaaac tgaacctgga gaaggtgtgt 600 

aatttgtcca aggttgcaca gg.caaagggg caaaattcag ctttaaaccc agga.ctgttt 660 

ccacagctcc aagtyccctt itattcatggg atttgtaaga tggagcccct gccactgtag 720 

catttataac ttactttgga gaataagatt cctgaaagta cgtttaataa aaaaaaaaga 7 80 

tgtccagcta tgtacggcag ctcacgcctg taatcccagc actttgagag gcaaaggggg 840 

gaggatagct tgaggctaag agtttaagac taacctgggc aacatggcaa gaccctgtct 900 

ctaaaaaaca aaattagcca gttgtggtgg catgcacctg tagtccaagc tactcaggag 9 60 

gctaaggtga gagggtcgct tgagcccagg agtttgaggc tgcagtgagc catgatggcg 1020 

ccactgcact ccagtgcaga gtgcaggcta cagaatgaga ccccatcaca aaaaaaaaaa 1080 

aaaaaaaaac tcgag 109 5 



<210> 42 
<211> 1162 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (340) 

<223> n equals a,t,g, or c 



<400> 42 

ggcacgagct gattcctaag gaatattcta gccaaatcat gtatctgtgg tttagttttt 60 
ctag a g ta gg gc tgtgcggt tgctgcctgc tttatagggc^ atgtgggttt atatggtatc " 120 

tgctgttact tgggcacagc agcaccaact cattacagga tggaggggca gaacgcccag 1~8"0 

agcacccctg ggctcacgtg cggtacagct gcaggagaga gctgtccttt tggttttatg 240 

tttttaatta attctgtttc ctcagattga tgattaaatt tatttttcca gcctgaccaa 300 

gaaggcgtca ccataccaga tctggggagt ctctcctcan ctctgataga cacagagagg 3 60 

aatctgggcc tgcttctcgg attacacgct tcctatttag caatgagcac accgctgtct 420 

cctgtcgaga ttgaatgtgc cagtaagaaa atctttactt tttgctaatt agcagatttt 480 

ttttttttgg aactgtaagt gccattaaga gtgggagagg gccaggcaca gtggttcatg 540 
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cctgtaatcc cagcactttg ggaggttgtg gcacgtggat tgcttgagat caagattttg 600 

agaccagcct gggcaacatg gcaaaacccc atctctacaa aaaacacaaa aattagccag 660 

gcatgttggc acgtatttgt agtcccagat actcaggagg ctgaggtagg aggattgctt 720 

gagcctggga ggttgaggct gcagtgagtc atgatcatac cactgcactc cagcctgggt 780 

gacagagcaa gactctctct ttaaaaaagc aggagatggc caggcagtgg ctcatgcctg 840 

taatcccagc actttgggag gctgaggcgg gtggatcacc tgaggtcagg agttcaagac 900 

cagcctggcc aatgtggtga aaccccatgt ctactaaaaa tgcaaaaatt agctgggtgt 960 

ggtgacgggt gcctgtaatc ctaggtactc gggacgctga cgtaggagaa ttgcttgaac 1020 

ccaggacaca gaggttgcag tgagctgaga tcacgccact gcactccagc ctgggtgaca 1080 

agagcgagac tcggtctcca aaaaaaaaaa aggagaggag gattcaacac agttgatgat 1140 

gacaaaaaaa aaaaaaaaaa aa 1162 



<210> 43 
<211> 657 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (12) 

<223> n equals a,t,g, or c 
<400> 43 

cccccccggg gntgcagc[aa ttcggcacar attt.tacatg cttttaagtt aatgttggaa 
aactaatcac aagcagtttc taaaccaaaa aatgacatgt tgtaaaagga caataaacgt 
tgggtcaaaa tggagcctga gtcctgggcc ctgtgcctgc ttcttttcct gggaacagcc 
ttgggctacc caccactccc aaggcattct tccaaatgtg aaatcctgga agtaagattg 
caccttcttc ctctcctgat caacatcggt atgatgtctc ctgttgcctc accctttgtc 
tgcagtatca ctggatagga ctggtggaaa gggagcagcc tgacagagct ccaaatgtgg 
agaatatggc atccctccac ctatatttga tgtggacggt aaggctaggc ctgcaggatc 
ccttatcctg accaaagact gtgttggggt gccatttgaa aatcgcaggg ttgcaaaaga 
atacaatctt acttgcaggt ggatattctc tatactctct tttaatgcat ctaaaaatcc 
caaacatccc ctggttggtg atcacttaca gttgtgtcca cctttatttt atgtactttg 
attaaaaaaa aaaaactttt tgttaatata aaaaaaaaaa aaaaaaaaaa aaaaaaa 



<210> 44 
<211> 1155 
~~<212> DNA 
<213> Homo sapiens 



<400> 44 

ggcacgagtg gaagtgtaag cagaaataca gcgagggctc aggaaatact agaataggca 60 

acatgctctt cctctctgct tctatctgca catctgcttt atttctttgc ctcagcagac 120 

tcaccatctc tgctcctcat cccgcatggt ggggaaggat gcccacccac acctccccag 180 

gccatctgtt agagctccaa ccacgtggaa tgacggaatc cattctgttc tctatctctg 240 

ctctagtt tc aaattcctgg gga aaaatga cc ca gctcac t tcaggctcc cactctt ggt 300 

ccagtgggct gcaaaatttc caagcgtagc ttctgtcagt tccttgcttt gggttaggtg 360 

aaaatgaagg gaataattgt gagctgttca gattcaccaa gaaattatct actattgttg 420 

ggggagaatg cccaggggac agatgcattt gggtaaggga caataacaag acactagaaa 480 

ggaaaatccc aattttattt tcctacagag tcagcatccc acacattttc cttcacagaa 540 

actgacaaat aatccatggg ggcagcttag cagatgggtt gaaaaaagcg acaggctcat 600 

catcagtttt caacaccttg atacatcagg cttggccctt gctacctcat gcattattta 660 

agcacaatgc atctccctct aattgtgtca tgtgctggag gagaatgtga agttctgtct 720 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
657 



Okicrwirv ^khtrx mA *rc >• n a 4 1 ^ 
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gtctttagca aacatgtttc aagtactgtc tgtctgaaaa ccaaatggaa gagggtaaac • 780 

ttgatgatcc acttgatttt agttttagga cctggatgca taggcagatg tcagtttaca 840 

aggattctgt gtactttaag gaatgttttc tgagcatgtc cagtacaaca gacgctctgt 900 

taggtagctg tagttaggat tttttggttg taagtatgtg aagatttaaa tgtatcagct 960 

cacttactca gaaaatctga ggcagtgcta gccaaaccaa atggttcaag caaatgtcat 1020 

cagtatttgg cctcttccag tctttttact cctctatcct ctgtgtctgc ttcacttcta 1080 

cacaagcttt ctctatgtgg tggctccaga ttttatatct tctagtagat atttttttaa 1140 

aaaaaaaaaa aaaaa 1155 



<210> 45 
<211> 1112 
<212> DNA 

<213> Homo sapiens 



<400> 45 

gccggaggaa gagcgtctgc aaaactgggt tcctagaagt atagacggac ttagcttttw 60 

gtagaatttg gtgaggagca gcgcctcgtg agagcagaat ggcctggcgt ggccagtgct 120 

tcccggcagc acgcagctct gcggcctcca gaattcccct gttctgagct tgatgcccct 180 

agcctgtccc ctacctactt cctcccctcc tctctagccc tctcacaggg gtgattgcta 240 

cctctctgtt ttcttgggcc taggcaagtt ttagaggagt tcccaagcat tgttatgagg 300 

ccagtgtgct cgctgggctg ggcgggatgg cctgggcttg tgtgtggcct gagggctctc 3 60 

ctggggcctt ctcttttccc agtcaccttt ggagccacag aagcagtgca ctcattggat 420 

gtctgttctt aacacagctt ctctttctac attaaaaaaa atcattattg cattttggaa 480 

agcagtgctc atcaaaagca acttttaaaa.cctattttat tgttccttta aatgttctct 540 

cccgctgaaa ctgccctgga gaggctatct gctgctcttc catttaccca catcaggtta 600 

ttctccatgt cactcagtgg agatgactcc agatgtgttt aaagmctgga caattcacct 660 

atactgtgta ggaaattacc tccttaatta cctggtmgaa ttgtcagcag acatgttcat 720 

ccgatgatag tactgcagtt ttctattaat aatttgcaga cttttatcta acctgcactc 780 

atgtacagat tattaaaagt tttaaaatgt aactgatcag tattgatcaa tcattgtctt 840 

gatttttttt tacagcgtat atttctaatc atatttttta aagccaagag aactggttga 900 

atgaatgttt attttcctga aggtattttt aagataaagc ttcctaatgg cgtgtaaact 960 

ttgcatatgt atgtagtttg atacatattg tcacatttga aaatcttgtg ggttgtaact 1020 

ggttttatac aaaatatcga atagtggaaa ttgtataatt acaatcatgt aattaaaagt 1080 

attaacccaa aaaaaaaaaa aaaaaaytcg ag 1112 



<210> 46 
<211> 4023 

<212> DNA " : ~ ~ " " - .- — 

<213> Homo sapiens 

<220> 

<221> SITE 
<222> (1049) 

<223> n equals a,t,g, or c 

<220> ; 

<221> SITE 
<222> (2758) 

<223> n equals a,t,g, or c 
<400> 46 

cccacgcgtc cgtccaaaca tcaggaggca ggcagcatgg taaatgagaa agaagccagg 60 
actgggagtc caaagtcctg gcttctatgt ctggctttgc tactaatcaa atatgtgact 120 
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ttttgcaaac catacctcac taaaccttac tttcttcatt tgagcgtgtt ggaccagctg 180 
tccccaggaa cccccttgga ttgatctgag aaggcaagga taagtttttc aaaggaagaa 240 
aagaggagta gtcagtccgc agtacagtag acacaagccc caggacatct gagtgtcttt 3 00 

cagcaagaac tctctgtgat atttcactac aatttctctg gcaccttggg actctcctca 360 
gcccttgtgg tggtgggtct tgtttaacta gcagttccct ccattctatg cctgtgaaga 420 
atctatcacc taccatgtga ttacagtgca gatttttttt tccttttcct tttctttttc 480 
tttctttttt tttttttttt tgtttgagac ggagtctcgc tttgtcaccc aggctgcagc 540 
gcagtggcgc gatctcggct cactgcaagc tccgcctccc gggttcaccg ccattctcct 600 
gcctcagcct cccgagtagc tgggactata ggcgcccgcc accgtgcctg gctaattttt 660 
tctattttta gtagagacag ggtttcaccg tgttagccag gatggtctcc atctcctgac 720 
gtggtgatcc gcccgcctcg ggctcccaaa gtgctgggat tacgggcgtg agccactgcg 780 
cccggcctac agtgcggata ttttatgaga gaggagatca caactcagtc cccaagccct 840 
caacccttaa tacatactat cgtatgaaat gcctctttcc aaattcagcc ttttctaaaa 900 
ctcaagatga gaaaactgct gatgaggctc actttctaaa ataccggaat ttgcaatata 960 

gggagaatag tttttcatgt ttctttgttt aagcaataga aagaaaggaa acttatgtcg 1020 

tttacttttc aggccataga ggttttcana acaacttgaa aacatgatca aattagccaa 1080 

acttctgata gttttcaatg tagtctgtga tcatgggata atttagcctc agttcttttt 1140 

ctgaaattgt gttttgaatg tttgatttga cttatttacc atcaaacttg ctataaggtt 1200 

attactctaa tgaataagca tattccctta attgggacaa tttactatta tttctttcat 1260 

aaagtagggc accattcacc atctatttcc tggctcttta gttatcaaaa tgttaagctc 1320 

attgctattc atcccggcac agcacttata tgagaggcat gaagctggct gaattctgca 13 80 

tcattaggaa tgacacagcc tcatcacatt gacaccagtg cttgtctctc acaccaatcc 1440 

aaattaagac caactgaaaa tagtcagagt ttcctctgga gctccttttt gaagagacat 1500 

atgtttttta gtctggtggt acccaaaatt gaacaaaaaa tgggtgctgc ttctcttaat 1560 

aggcaaaact atgctgcagg a.taatgtatt- catgcaggg.t c'ttccagcca gaccccaaat 1620 

catccctccc ttcactagaa ttttfcctgt-t taattcgatg gcca'ctctcc acagggatcc 1680 

attctgtgtc ttattacagg agatgctcaa tgaatgaggg acttatcttc tagaaatgca 1740 

gctccgaggt agtctgttga gtgaaataat gaatccatta tcacaaaata aattgaaagc 1800 

tgtctgacat ttggacaatt tttattttgt ttcacattgt tctgaaaact atactgtttc 1860 

ttttctccct attatttaaa taagcaaatg atgaacagat tacaaaattg aggacactcg 1920 

aggtagggaa ggagcccctc gacaggagga tcaggacata gtaccaaggg caagagaaac 1980 

gattcaataa acactattta ctatatattt taggcatggt tctaggtaat cacatgataa 2040 

gtagttgaaa gaactgaaaa tgttttatct gcaagaaaag ggcaagtgta atatcttcaa 2100 

attttagaaa gaatgtaaat tagaatttga cttaatttgg tgtagttctt gtgggcagaa 2160 

attgaattga ataggctgaa agttataaga aggattttag ctcagtattg atactggatt 2220 

gctcatgggt ggtgagagtt actcatcact ggaagagttc aagcaggggc cataagaaat 22 80 

ctcagggatt ttataaggtg attcatgctc tgggaaaagg atgccttgga ttattgtgtc 2340 

agggtacttc taactctagg attctggttt ctaagatctg gactctagtc ttgccactca 2400 

cctgccatca agaacatgtt cctcatctgc aggacaggac caagatggct ctgtctacct 2460 

— taeegggt-tg- etgtgaggcg— fcgat-tgtgat— aaaatacata- aaggcagttt -ttaagctctg 25-20-- 

aagcactagt taaatgtgta gcgtatttta agattctgtt gtatgtacaa ttgtttagca 2580 

gtctctctct ctttctttct ctttctttta tcagagatag atgattttcc ctcttatttc 2640 

caccagtttg gcttttcagg gaaggtggca gctggcagaa tcccctgaca acaaaaggta 2700 

cagcaaaaaa gtggaggcct aaagaaaaca tgtgctagct ctttagcccc tgaatagnta 2760 

agtcacatgt cagcctgctc tccttcatct gtttgggagg aggcagatta gagtcacact 2820 

gtcatcatgc tctttcccct cagaagcagc tgtaaggttt ttggtagctg tcagtgctag 2880 

caaacagtgc ttttctcaca gaactactgg aaagagtcct ggctcggaaa acttgctctt 2940 

gaaagtggca cggccagagc aggggtctct agagggtcgt gccacctcta cctgccacag 3000 

gt:t^caet^gtT"cggt:caggta agttagaggc agcagttccc cacctgccct ctggataaca 306 0~ 

gcagcctggg gctgctcctg agtcatgttt ccacttctgt cttacaggcc teat tt tec t 3120 

acccatcttt ctgtaaaaat gaaagtcagg agtcttatga aacttaccat tattcaatac 3180 

aggcttttgg tttttttctt taaattagat agggttaggt aagaagtaga gttctataga 3240 

aegttcatag gaagcaacaa aagttgatct cttggtctct acaataggag aggattgggc 3300 

tagatacctt caaagctgac ttgccctaat attctagtat gaaatgattc gaaggtacac 3360 

ctgcccctat catgtcaggc agtgagtaca gttaaaacat tgggaattgg taaaggaaag 3420 
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aaaaaaactg aaaagaaccc tttgaagtta gacaaactgt ccagagacat agtgctaaaa 3480 

tcctccytct ttttcttycc acagcttcta gaattcctct ccagagctac tctcaagtta 3540 

tatccagggg acaggcccct ttggctccaa cccacacgcc tgaactttaa ggatcattgg 3 600 

actatcttct ctgtggccar cgcagctctc ttctgtgttc acagaatggc catgataggc 3660 

atgctctttt cccacccact ggaaggctca caggcaaggt gagagaggac acagaaggtg 3720 

ccaacactgt cgctacagta- aggacctgaa gtgactttga gaaattcacc ctcacaaacc 3780, 

ttccttcagg agcaggcatt ggtagtgcag aggcacagat tccgtccttt accagctgca 3 840 

gaatcttggg caagttacat agcctctgtg agcctcatcg gtaaacagtg ggggttatga 3900 

aacccacctc acagggttgt tgtgaggatc caatgagttg atttaggtaa gcacctagca 3960 

catgccgtgg caccaagtaa gcactcaata aatcactcaa ctcctttaaa aaaaaaaaaa 402 0 

aaa' 4023 



<210> 47 
<211> 542 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (389) 

<223> n equals a,t,g, or c 
<400> 47 

agggcacgag tttttttatg actacataat/gtttattgcg atctatttta aggcttttca 
tggatctttt cagctatgea catggttagt cataatgatt gtcattttag gtcagagttt 
ctcagcctta gcattgttga cattttggtt aattctttgt tgtaggggct gtcctgtaca 
ctgtagggtg tttagcagca tccctgatct ctacctacta aatgccagga gcaacacagt 
acctccagct cagttgtgac aactactgaa tgtctccaga caactactga atgtctccag 
acattgcaga gaaattgagt ctggttgaga agtcactgtt ttagggcata atttttgggt 
agactgttag attctttgtg ttcgttgtnt ctggcctgta taactcttct taattatctg 
ctactcaaat gtatttggga tcagccactg tcttccattt ctcttttgct cacagatcta 
ctccacagct cttctccctt caaacactgt tcctcagcat cttgtttttt gcagccaaac 
at 



<210> 48 
<211> 1495 
<212> DNA 

<213> Homo sapiens 



<400> 48 

cggcacgagg ctacttatat tttatgaagg acattttttg ttagatgatc tcatcctctg 60 

tgttatttgt tgattgggtt tgttttttgc ttgttggttt gtttgtttct tccatgtaag 120 

gaaaagtagt gtaagcagta ggaagaaaat gaggaagatg tattttgcat gttcttcctt 180 

tcaatgttct tacacattgt attactgcat tgtggtaata gcttctataa aatctgccat 240 

agctgggatt atgcagcttt gcaagaatct actagatttt attctaactc atattagctt 300 

gtcctatcaa cttctggaat tatctaatta* ttgcttttaa aagtttcctg cctttcaacg 360 

-ft tec c fgc t — at gcaaaacc tzttcccagac cttggttt ct~taaaagaaag~ atgttgctac 42^ 

agttcccaat tctttcttat tacaggctca ggtgtacagg ttattctggc ttaattttat 480 

ctaatgaagc ccattccttt ttgtacatga agatgtcact taaacctatg tttacaaact 540 

aaagagacta atcactcaat atgaaaacat gaaaacattt ttgcttaaaa tattaagatg 600 

gaaatagtta aatatggatt attttgtcct tttacttttt aaaaaaagtt acatattgta 660 

tgcactgtgc tgatgeaaga attctacatt ttaatgaatt ataaaattat tctgcatctc 720 

atcacgtcac agtatttctg ctctatttat tcatatacat agaaatatat atgggcttaa 780 



60 
120 
180 
240 
300 
360 
420 
480 
540 
542 
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tcatttaaaa tttgttgcag caagaactct cctacctgta ggcaatagat tgctatgttt 840 
tcaacaaatt gtggcaaatt ctaaacagca attcttttgt acgtaatagg acatttcata 900 

ctagaaaaat aaagtaatgt ttttgacatt ggatttggtg cagtttctaa tgaagcaatg 960 

gttggttggt taatatgtct tctgtagctg ttagcattgc caaattaaaa agggtaaatt 1020 

ttatggaaat cctgagacca ggaagatatc aatttcatgt gtacttaatg gtataaagtg 1080 

ttttacagtt tctatcacca tacaaataca taaagacatt ttatagtttt atcaactata 1140 

gagctttagt ctttcaaaag taatttttga aaaacataca ttcctggcca ggtgtggtgg 1200 

gccacgcctg taaccccagc actttgggag gccgagggag gggggatcac ctgaggtcag 1260 

gaatttgaga ccaacctggc caacatggtg aaaccccatc tctactaaaa gtacaaaaat 1320 

tagccaggca tggtggcagg cacctgaaat cccagctact agggaggttg aggcaggaga 13 80 

atcacttgaa cctgggaggt ggaggttgca gtgagccgtg atcacgccat tgcactccag 1440 

cctgggggac aagagtgaga cttcatctca aaaaaaaaaa aaaaaaaaaa aaaaa 1495 



<210> 49 
<211> 818 
<212> DNA 

<213> Homo sapiens 



<400> 49 

aaaacttgag tatgttgagg gaaggaatat atatatatct gggagagaat ggatacgttt 60 

tgtttttctg aaatggaatt agaaagatgt tcagttgtct tgtgcattct tgcaaacctt 120 

gcagttttga gagccctgtt tctgccttgt a teat tt tec actgtgtatc kgattctagg 180 

agcgtgaaca gggagacaaa ggtgaagttt gtgcacacct ctgtccatgg ggtgggtcat 240 

agctttgtgc agtcmgcttt caaggct.t'tf gjncc.ttgttc cycctgaggc .tgttcctgaa 300 

cagaaagatc cggatcctga gtttccaaca gtgaaatacc cgaatcccga agaggggaaa 3 60 

ggtgtcttgg taacctaatt tttttttaaa ttat-gaaatc tgcttttata ttcaaaacta 420 

ttactgtcaa gtaaaataca tttttatgtg ttttcattgt gctgaagaaa aactaatttc 480 

agcatggaaa tatgtatgtt tggctgggtg cagcgtctca tgtctgtaat cccagcactt 540 

tgggagacca aggcaggcag atcacttgag gtcaggtgtt cgagaacagc ctggccaaca 600 

tggcaaaacc ctgtctctac taaaaataca aaaattagct gggtgtggtg gtacatgcct 660 

gtaatcccag ccacttggga ggctgaggca ctagaattgt ttgaacctga gagatggagg 72 0 

ttgcagtgag ctgagattgc accactgcac tccagcctgg gtgacagggt gacagagcga 7 80 

gactctgtct caaaaaaaaa aaaaaaaaaa aactcgag 818 



<210> 50 

<211> 1711 

<212> DNA 

<213> Homo sapiens 



<400> 50 

ggcacgagcg ctcctgtcct gccactgagg gacccggtta ccaaccctca tgtagctcag 60 

tttgcccatc tgtcccggtg ctaacacaca gttctcggga gactttcccc attcccagag 12 0 

gagtagtgcg aaatgcgtgt acctctagtc ttaagctggg cgtttgtatt agttgggttt 180 

tctggtgtct atttagcaag tgaaagtttc tggttccctc cttcactgtg tgacctgact 240 

agtcctcctg gattgcattt atggaagttt atacgagacc tagtttccat ggaggaactc 300 

act gattccg cga gggagat ggggta ctgg atgatggtct tcagcc tt aa ggctat gttt 360 

ccagtgtcct ctgggtgttt ccaagagcgg caagaaacga ataaatctct gacccttctc 420 

aggtgcagcc agagagacac tagcccactg atggacggac agacgtgggc aagggtccgt 480 

gtcactaaac cacccaccac tgccacagct gcctacaaca gacacatcag atgacactcc 540 

gggcaaataa atgattttca ctgaggactt actggtttta ataataggtc ctggtgtaga 600 

gaagtccctc aacctattgt gcaacgagtt ttgagaagcg ggtaagctgt atgttttgtg 660 

gttttgtttc ataaattcat ctacaggaag accaatattg actgaatgaa gctttcattt 720 

aaagagctaa aatatgcttt gtgtttttat atgtggatac tactttaaac ctaacgacta 780 
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ttcattgtat catagcttgt gatgtattct gctcatggct tttaaggtaa attgtgccat 840 
gatccactgc cattctaatt gctttaacaa gtcattacca cactactgtt acatcttaat 900 

tatgcataca gacaggtaga cttgttttac atatgtgaac taactagttg tcaaagcaaa 960 
tgcagattgt attctgcaag taaagtcttt ttctctctga aatttctagg- gatgttcttt * 1020 

aagtgaaatt catattaaaa ctgaagattt tagttacaag aactgagtgc agattaagtc 1080 

tttgtgattc aacatagtca agatacaact gtggatattt catggaagta tgcaataaaa 1140 

tgtctctacc tggaaaaatc tatcaagcag cgtcacagta ctgaatttga aaccagaaat 1200 

actgggtttt tatataaatg cttcatagat ttgttttatg ataaagggca cataactctc 1260 

ctaaacctca caccacctct tgaataggta taataagtcc acatcaatgc tgatgcctta 1320 

gctattatta aactcttaca gtatgatgta aagtgaaagt acaatgtaag atcattccta 13 80 

ggccaacttt gaccagtttt atacagaaac atgtgccaac ttttctgttt gcaaggataa 1440 

tatcaaagca aacaccagaa agttatatct ttgatgcatt ttttcaaaat catacacata 1500 

atacacaaac caaagacaaa tgatgaatat tacgtcagaa aatataaagt cttccccttt 1560 

cttcttttgc caagaaagtc caatattttc accattttta tgcacacaat caactttatt 1620 

taagctggaa gttaatgtct cattgttttc attgttctaa ataaacacct tttcccttga 1680 

gtattgctct aaaaaaaaaa aaaaaaaaaa a 1711 



<210> 51 

<211> 749 

<212> DNA 

<213> Homo sapiens 



<400> 51 

gccaaaccag rtaataattt ccttataata - catgaagtcg ttattt.tgca tttattttct 60 

taggtggcca atggggttat ettgggggga gacttttata ctcctaaggg acagcttggc 120 

cattaacttt caaagtttct ctaaagcagc gtcaggagat atatttggtt gtcatgacta 180 

gtggcattcc actgacatgt aatgggtaga ggctgggtag acatcctacg atgcacaaga 240 

cagcctccca caataaagaa ctgtgtggcc caaaaatatc agtgatgctg agattgagaa 300 

acttaaagaa atttaaaaat taactctata caaaatctaa tgtttgagtt ttctccatgt 360 

atctgtgact gcaatgacca gagtgactgt ccataaagaa agtgctaaga gttggctggg 420 

tgcggtggcc tacacctgta atcccagcac tttgggaggc caaggtgggt ggatcacctg 480 

aggtcaggag ttcgagacca gcctggccaa catggcaaaa ccccatctct actaaaaaat 540 

acaaaaatta gctgggtgtg gtggcacgca cctgtagtca cagctactca ggaggttgag 600 

gcaggagagg tgcttgaacc cgggagatgg aggttgcagt gagccgagat tatgccattg 660 

cactccagcc tgggtgacag agtgagacaa aaaaaaaaaa aaaaaactcg agggggggcc 720 

cggtacccaa ttcgccctat aggcagtcc 749 



<2i0>"52 " 

<211> 1091 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> SITE 
<222> (1079) 

<223> n equa ls a ,t,g, or c 



<400> 52 

ggccagtggg cagggtcaca gggcaaggtc ccgcgggccg ctgggtgcgg cgacttccgt 60 

gctcccggcg agcgggcgga gagcgggggc cgcactgggg agtgtgggct gggccgcaga 120 

tgtcatgtgg cctgtktttt ggaccgtggt tcgtacctat gctccttatg tcacattccc 180 

tgttgccttc gtggtcgggg ctgtgggtta ccacctggaa tggttcatca ggggaaagga 240 

cccccagccc gtggaggagg aaaagagcat ctcagagcgc cgggaggatc gcaagctgga 300 
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tgagcttcta ggcaaggacc acacgcaggt ggtgagcctt aaggacaagc tagaatttgc 3 60 

cccgaaagct gtgctgaaca gaaaccgccc agagaagaat taatggagga cacagggccc 420 

tatggtccta ctgtgggtgg tgacttgtcc tgctaccatg ttgacagagc cccagaaccc 480 

acatctaatt ggctttgttg cttattctgg cccttcccac accacacagc cacacaaata 540 

ctggctgctc cttgatggcc aggcagaccc agcagcagcc gaggggccag tgaagaggaa 600 

ggccgcatct gttgtgtggt ggccacaagc actcaggcat ctgagtttac tggtgcactg 660 

ctgggaggag agttatgaga tgaacattgg ctgtcaatct ctgtgggcag gcggtttggc 720 

ctctagtggg aatggctggg atttgggcgt tgcctttagg agggatacct gcatgtctag 780 

ttccagtctg cactggaaag aattcaaata tgcacctggc tcccttcact attttgccct 840 

atcctttgtg ctcattctta ctgaaatctg tcttgtcagc tcaggaatgg gattccccca 900 

ggaaggaaag cacttttctg ttctgggaag cccagactgt tcactttggg gcagggacga 960 

acatgtgcct' cgtgaatttg cttgaaaaca gtcaccatct tctaccccca tcactgtata 1020 

gtgaaaaacc tgattaaagt ggtatctgag aaccawaaaa aaaaaaaaaa aaaactcgng 1080 

ggggggCCCg g 1091 



<210> 53 
<211> 2254 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (1182) 

<22 3> n equals a,t,g, or c 



<400> 53 

ggcacgaggc ccgctgcaat gttatcatca cccaacctcg ccgcatctct gctgtgtctg 60 

tggcacagcg ggtcagccac gaactgggcc cctccctgcg ccggaatgtg ggcttccagg 12 0 

tgcggttgga aagtaagccc ccatcccgag gcggggccct gctcttctgc actgtgggta 180 

tcctgctgcg taastgcaga gcaaccccag cctggagggc gtgagccacg tcatcgtgga 2 40 

tgaggtgcat gagcgggacg tgaacacaga ctttctgctg atcctgctca agggcctgca 300 

gcggytcaac ccggccctgc ggctggtgct catgaktgcc acaggggaca atgagcgctt 3 60 

ctcccgatac tttggtggct gccccgtcat caaggtgcct ggcttcatgt acccagtcaa 420 

ggagcactac ctagaggaca tcctggccaa gttgggcaag caccagtacc tgcaccggca 480 

ccggcaccat gagtctgagg atgaatgcgc actcgatttg gaccttgtga ctgatctggt 540 

tctgcacatc gatgctcgcg gggaaccagg tgggatcctg tgcttcctgc ctgggtggca 600 

gagatcaaag gagtgcagca gcgcctccag gaggccctgg gcatgcacga gagcaagtac 660 

ctcatcctgc cagtgcactc caacatcccc atgatggatc agaaggccat attccagcag 720 
cctccagttg gggtgcgcaa^ gattgtcttg" gccaccaaca ttgctgagac " ttccatcaca ~" 780" 

atcaatgaca tcgtgcatgt ggtggacagt gggctgcaca aggaagaacg ctatgacctg 840 

aagaccaagg tgtcctgcct ggagacagtg tgggtatcaa gagccaatgt gatccagcgc 900 

cggggccggg cgggccgctg ccagtccggc tttgcctacc acttgttccc tcgaagccgg 960 

ctggagaaaa tggtcccttt ccaagtgcca gagatcctgc gcacacctct tgagaacctig 102 0 

gtgctgcaag cgaaaatcca catgcctgag aagacggcgg tggagttcct gtccaaggct 1080 

gtggacagtc caaacatcaa ggcagtggac gaggctgtga tcttgctcca ggagatcggg 1140 

gtgctggacc agcgggagta cctgactacc ctggggcagc gnctggctca catctccacc 1200 

q accc ccqgt tg_gccaag_gc_cattg_t g_ttg^ g ctgccatc t tccgttgcct g caccca cta 1260 

ctggtggtcg tttcctgcct cacccgggac cccttcagca gcagcctaca gaaccgggca 132(T 

gaggtggaca aggtgaaagc actgttgagc catgacagcg gcagtgacca cctggccttt 1380 

gtgcgggctg tcgccggctg ggaggaggtg ctgcgttggc aggaccgcag ctcccgggag 1440 

aattacctgg aggaaaacct gctgtacgca cccagcctgc gcttcatcca cggactcatc 1500 

aagcagttct cagagaacat ttatgaggcc ttcctggtgg ggaagccctc ggactgcacc 1560 

ctggcctccg cccagtgcaa cgagtacagt gaggaggagg agctggtgaa gggcgtgctg 1620 

atggccggcc tctaccccaa cctcatccag gtgaggcagg gcaaggtcac ccggcagggg 1680 
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aagttcaagc ccaacagcgt cacatatagg accaaatcag gcaacatcct gctgcacaag 1740 

tcgaccatta acagggaggc cacacggtta cggagccgat ggctgacgta tttcatggca 1800 

gtcaagtcca atggcagcgt cttcgtccgg gactcctctc aggtgcaccc gctagctgtg 1860 

ctgctgctga ccgacgggga cgtgcacatc cgtgatgacg ggcgccgggc caccatctca 192 0 

ctgagcgaca gtgacctgct gcggctggag ggtgactcgc gtaccgtgcg gctgctgaag 1980 

gagctgcggc gggccctggg ccgcatggtg gagcggagcc tgcgcagcga gctggctgca 2040 

cttcccccca gcgtacagga ggagcacggg cagctgcttg cgctactggc agagctgctg 2100 

cgaggaccct gtggcagctt tgatgtgcgc aagacagctg acgactgagc cctgcttctg 2160 

ctggggctgt gtacagagtg caaatgttta tttaaaataa agttctattt atcccttgtg 2220 

aaaaaaaaaa aaaaaaaaaa aaaaaaaact cgag 2254 



<210> 54 

<211> 486 

<212> DNA 

<213> Homo sapiens 



<400> 54 

cacactgaca tctccccaac aggtgagggc agggagagct ccagacaggg agaggccttc 60 

agagaacagg aaggaagctc cctccctcct ctgcattttg cagcctgtag ctcacgtgcc 120 

ttttatgccc cacatctcat tctgtctggg gactccatac gtagtggctg tctaccttcc 180 

cgcgtggatt gtaatgcttt tgctaccagg ggtcaggcca tactcatcac tgcaggccct 240 

gaagcat'cca tcatgttcct cgagctcagt atgtgctccg tacatgtagc acagtggaaa 300 

aacttgagct ttgctggcaa agacagacag aatgagcttg aatctcagcc cagctatggc 360 

ttttctagtc ctgtggctag aaaatgac.tt -agcctcttgg actttggt.ta acccatctgc 420 

aaaacaggga tggcacccac ctctta'gaaa gttacagtgg tcaaaaaaaa aaaaaaaaaa 480 

ctcgag 486 



<210> 55 

<211> 1270 

<212> DNA 

<213> Homo sapiens 



<400> 55 

gaaaccatcc aagataagag acatgggagt gaaattcaca cccactctgg ctttcatacc 60 

atgggtctga acattagccc atggtgtttc ttggccatac tgacctgtgc catttcagct 120 

gcattcatct cagttggtgt tgtctgctgg ctgctctttc tgatttccca caggagcagt 180 

aagaacctga ggaagagtag ggtcagagga gtctgggaga atgaggaaat atgagagccc _ 240 

caggaactga aaaggcctgt gagagactct gagcttcctg ggaacaggta taggttcttt ~ 300 

ttatttcaat aataacagaa acaactgtca aaaccatgtg cctgtactat ttggagtgct 3 60 

gtccttgcag aatctcatta taagaacctt aggaaatagg cacatcatct cctggataga 420 

atcctaggaa atgggcacta taatgggcac tttatcccat tttataaaca tggaaattga 480 

ggcacagaga gattaagtac tttcccaagg tcatacagct agtgatggag gagctagcat 540 

ttgaacccsg gagtttttag tctattgagt ttaaccgaca gatcatactg tgttttggta 600 

gggaggragg gtgaagcaag caartgaaca aatgartctg ggatttarga cttgccagac 66.0 

aaacaaggcc caagaggcaa gtgtgcaggt gggtgtagtt gggagtcagc agagttgggt 720 

- tggaattcaa- getttgeeae -Gtgc tggcta"~taaaccttgg~ttgggtaagt-aacccaaggt_^ 7JJ_0_ 

aaatgagatc atctctgtaa aactcttagc cttgtgcctg gcacatagta aatgcttaat 840 

aagggttcac tgttagtatt actgttactg ataacataca aatagattgt attaatggac 900 

cataattgca actgtataaa acaaattcca tgtttggcca ggcgcagtgg ctcaagcctg 9 60 

taatcccatc acttcgggag gccgaggtgg gcagatcacg aggtcaggag atcaagacca 1020 

tcctggctaa cacagtgaaa ccccatctct actaaaaata caaaaaaatt agccaggcat 1080 

ggtggcgggt gcctgtattc ccagctactt gggaggctga ggcaggagaa tggcgcgaac 1140 

ccagggggcg gagcttgcag tgagctgtaa ttgtgccact gcactccagc ctgggcgaca 1200 
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gagcgagact ccgtcycaaa aaaaaaaaaa aaaaaactcg agggggggcc cgaacccaat 1260 
cgccctatag 1270 



<210> 56 
<211> 2059 
<212> DNA 

<213> Homo sapiens 



<400> 56 

ggcacgagcc 

tgcacagaga 

aaaaagtttt 

aagcctttgg 

agtggtaaaa 

ggtctagggg 

aggagcactg 

ctcttagggc 

ccccgcctgt 

ctgtcaaagc 

agacttttct 

gcaggaaatt 

aagatatgaa 

ccatgcagat 

tttgggagga 

tccacatctg 

tggcgtcaag 

ttaataggaa 

caagttggct 

gcagagcacc 

gcgggtgccc 

gaatcatgag 

tcgggactag 

cctaatccga 

cagtgctcag 

cctgcgtctg 

tttgaggctg 

ataggaatgt 

ctctgcagcg 

tatagtgtct 

gcctgtgatc 

gaggttgcag 

tttgtctcaa 

gcccagaagt 

taaaaaaaaa 



tcactgggta 
agcactaagt 
agggaacatt 
gcacttttca 
atacctcttt 
cctccaggtg 

ggagggacag 

cctgccactt 
gcggctggat 
ggaaaacagt 
ccttcatgac 
ttgtttcctg 
ccgagagccg 
aacacagcag 
accgctgggc 
ggctgagttc 
agaagtaaag 
agattttttt 
ctgctgaagt 
ttcctgctga 
aagagggctg 
aggttctcag 
ttgtgtttag 
acttcagaaa 
atatggtcag 
acagaagctc 
caccgtttct 
gctcgggcag 
tctttcgggt 

tggggtcctg 
ccagctactc 
tgagcggaga 
aaaaaaagga 
tctgatggag 
aaaaaaaaa 



aacacaagct 

caaagtcatc 

aactgttaat 

aacctaacaa 

gttatctaag 

tacggctttg 

tggaggaaga 

actttgctca 

ctgggctgcg 

tcacacacca 

cctcatctct 

cttgtcttca 

tggcagctgt 

ggtccaggcc 

aggttctg'cc 

ccaaagaaag 

tgttcagatg 

ttttatccgt 

agaaatggtg 

ccacagctgt 

agggctgcgt 

ggctgcctcc 

gttttcttaa 

tccaaaatgc 

accctggagc 

cagagaagtg 

ggaagtcaag 

ggaacccgga 

cctgtgggtc 

caggcttggc - 

aggaggctga 

ttgcgccatt 

acgtgcctca 

cttctgtcag 



ctcggcaggg 

tccgtgtggt 

ggcttatgta 

acaactgatt 

acaactttta 

aagtaagggc 

ggccccgccc 

gagcaaaagg 

tcagcaccgg 

gagcctcatg 

gctctcgggc 

aagacttcag 

ccagcttgca 

cttccctctg 

caggaacagt 

tgcgcctaac 

gcagtgattt 

ttggttttta 

ggcggggagt 

gagcgccggc 

ctgccatggt 

cacaggcttt 

aattctgtag 

tccagtgagt 

actttgcatt 

gctgcgagtt 

ccccacagtg 

gcaccagccg 

tggctgtgcc 

tgtacccggc 

ggcaggagaa 

tcactccagc 

ttcagtggtt 

acacaggctg 



aacaagctct 

cattaaaatt 

ttagctgttc 

gaatttctat 

ggtggcatta 

aagaagcatg 

gggtgccact 

tgccgtcagc 

gacccgcccc 

tggaaagagt 

ctcagcccgc 

ggcttagaac 

ggctgatatt 

gaactcacac 

taactctgca 

aacgtcttgt 

taatgaacct 

tttttagaat 

cgccactgac 

tgtacgcagg 

gtctccacct 

ctgtgtctta 

taattgcatt 

atttcatttg 

ctgaagtgaa 

cagggcaaga 

gacctcgagt 

ctgggcccct 

cagccctgct 

cccagtgcat 

tcacttgaac 

ctgggcagca 

cgatggtggt 

agtatccttc 



cggcagggaa 

ccagtgaatg 

tctgttttaa 

tgatggtcaa 

agaccccgag 

ggtgccctct 

gctgaggcag 

tgggtcctac 

agcagcacat 

ccaggcgctc 

ctgtcgtact 

ggaccataga 

tgtgtagatg 

tcggctgtat 

gagcacagtt 

ggaaggcggt 

actagctatt 

catgaaatag 

catcgtctgc 

tccctgctgt 

tggacaccat 

cctgggacac 

gtagagcatc 

agcatcatgt 

ggatgctcag 

ggctcctggc 

ccctctgtga 

cgttccccgg 

gcccgtggac 

ggtggcaag'tr " 

ctgggaggca 

agagcgagac 

tctgatcaat 

accccaaaat 



60 
120 
180 
240- 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2059 



<210> 57 



<211> 868 
<212> DNA 

<213> Homo sapiens 



<400> 57 

gactgactat agggaaagct ggtacgcctg caggtaccgg tccggaattc cgggtcgacc 
cacgcgtccg ctgaatttag gagacttttt acccaggggc aaaaggctct tagggtaatg 



60 
120 
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agatggatgg tggcccaggt gcattttcca gggcctgggt tctccagatc ccgtggcttc 180 

tgttgagtgg aggcaacttt gctctgtgtg aacctcgccc ctgtccctct gccgggcacc 240 

cctggcagga agcaggactc ccatcctcac cctgacttag actgtcctct gagtcagctc 300 

ctctccaaga caggagtggg cagccctggg cagtcttctg gccccttgct aaagtgaggg 360 

scaggaagct ggggctgccc tccagaaagc cggggtaggr actctgaaaa atacctcctc 420 

taaacggaag cagggytctc cagttccact tggcgccccc tcccacaagg cccttcctcc 480 

ctgaggaccc caccccccta ccccttcccc agcagccttt ggaccctcac ctctctccgg 540 

tgtccgtggg tcctcagccc agggtgagct gcagtcaggc gggatgggac gggcaggcca 600 

gaggtcagcc agctcctagc agagaagagc cagccagacc ccaaccctgt ctcttgtcca 660 

tgccctttgt gatttcagtc ttggtagact tgtatttgga gttttgtgct tcaaagtttt 720 

tgtttttgtt tgtttggttt ttgttttgag ggggtggggg gggatacaga gcagctgatc 780 

aatttgtatt tatttatttt aacattttac taaataaagc caaataaagc ctcaaaaaaa 840 

aaaaaaaaaa aaaaaaaagg gcggccgc 868 



<210> 58 

<211> 986 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> SITE 
<222> (592) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (669) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (767) 

<223> n equals a,t,g, or c 



<400> 58 

gaaattaagt catttagata aaaatatgcc attcttatct gtttggtttt ttaatcttgg 60 

cttaatattt ggggttgagt catttgtttt gagagctgtc ctgtttattg cagggtgttc 12 0 

agcaacatcc cagatggaag cagcatcccc ctacccagct gtgacaaaaa gaaaaaaaaa 180 

tgtctccaka cactgccaaa tatcttctgg gggtgcccct ^ggttgagaac "cac tgctt ta " 240 " 

gtggataaac tttaggcagg agggaaatga tcgcagttgg atagttggag gaatgtggag 300 

caagggaagc aataaactgt gaccataaaa acatagaaag atggcttata tgtggatttt 360 

tttttaaagc acgtagaatt gcttaaaatg gacaacagca gcatataaat cagtggcaga 42 0 

gttggtggct gaatttagag catcttaagt ctatgttctc ctggaacaga gtgcagataa 480 

ttcagttatc agcttggcta ggtgcatgtt gaagtattta gtcacacaca aacagttaat 540 

gtatggggaa gataacttct atactagtag gagagaaatg gaacaagaat wntaaataca 600 

ctatcaaaat atgcaagaat ggcaagmgga aaaggcagaa caagctgcaa aacmcacaca 660 

caat,tagana_t^aaaj:a_tt^^ c tc tgt taat 720 

gacaaagttc tccagttaaa ggaaggcaaa tagtgttatt aggaatngat tactatatga 780 

tgattaaagg ctcagttcaa caggaagatg attgatagaa ctttcctaca tttgtaacac 840 

agtcttagaa gatattaaag caaacattca agaagaaatt gatcacctac taccatagtg 900 

tattttattg aattggtaca tttcaataaa gtgtcataag gcacggttga aggaaaaaaa 960 

aaaaaaaaaa aaaaaagggc ggccgc 986 



WO 99/47540 



PCI7US99/05804 



34 



<210> 59 

<211> 695 

<212> DNA 

<213> Homo sapiens 



<400> 59 

ttttttttct tgaaataaaa tgggggagta atgggaaata atttttttga gcccttgcgt 60 

ttctaaaaat ' gtttgcattg tgccttcatg tttgacagtt cagttccagg ttgaaaatta 120 

tttttctttg gaatgttaac agctgccctc tattttctgt ttttatctaa tgttgctgaa 180 

gagaaatctt ctgattctta ttcttttttt ggtgacctgt tttaattttg tgtctttctt 240 

ttttttcccc tggaagcttt taggctctcc cttttatcct tgtagtctga gatctgacaa 300 

tgatggttgt gtctagtttt ccccaggata ctttttcatt tgtcctggtc agcactagta 360 

gatcctttca gtttgtgtac ttctgtttct cttgctctgg agaatttaaa aaatatatat 420 

atttttgaga cagagtctca ctctgtcacc caggttagag ggcagtggtg tagtctctac 480 

tcactgcaac ttctgcctcc tgtgtttaag cgattctcct gtctcagcca cctgagtagc 540 

tgggattaca ggtgcctgcc accacgccca gctaattttt ttgtattttt agtagaggca 600 

gggtttcacc acgttgccca ggctggtgtc gaactcctga cctcagatgg tccacctgcc 660 

ttggtctccc aaaaaaaaaa aaaaagggcg gccgc 695 



<210> 60 

<211> 314 

<212> DNA 

<213> Homo sapiens 



<400> 60 - • 

gtcgacccac gcgtccgctt tgaggagcat tcctctagat tgcacaaggg acagtgcctt 60 

taaccaagcg aggagtccaa agctcaggac ctgactaccc tgagggcacg ctgacgcctc 12 0 

ttcccagggg gatggggagc tttctgcacc cccagtggca tctcctcatc acgttctgtg 180 

ccgtccttgg gaaaggcctg cattctgatc cttccaggcc cttcgagcat ggaggggcac 240 

tggggaaggt cccccgaggg aggagcacgt tgctgagtaa agaggtgtta ctcaaaaaaa 300 

aaaaaaaaaa aagg 3.14 



<210> 61 

<211> 734 

<212> DNA 

<213> Homo sapiens 



"<400> 61 " " " - " — - - 

gactgcttat atttggcatt gtcttttccc tggcactgcc actgtcacca ccatccccct 60 

tctggatccc tactttaccc cttcatgctg ctctggtggc agtgcctctg ctgccatgct 120 

gtacttgagc ctgctgctac agccatgcct gaagatgcag ccccttcctc tcttcctgtc 180 

ccaccaaata tgaccagctc taggttccat tacttctgga ctttgctcca aataaaactt 240 

acacaatttt attccaaacc caggtctctt tctgcaacac ccgagaaaaa tattgggctg 300 

caggagccag agaggagaga gagatttact ggtgagagtt gtaggtggga attgaaagcc 360 

aagtcatgtc tttgccccac cagaaactca ctaggatgta cacaatgcca ctgtgatggt ^ 420 

k 1 1 aaaa Jta.t_g ta^c^aa^^gcacg^g^gc^^a t.gtac_ c ctaaaactt caagtatata 480 

taaararaga aagaactgst gatacacata tcatgaaaaa agaccaaata aaataaaaaa 540 

ataaaaataa ataaataaaa taaaatatgt ccacaaatgc tttgatgttc ctttgtttct 600 

tgatctgtat gctagcaaca caggttcatt ccgtttgtga aaattcattg agctgtgctc 660 

ttatgagctg tgtacttctc tacatgtatg ttaaatgtgg acaagaactt cacataaaaa 720 

tcattttaaa aaaa 734 



BNSDOCID: <WO 9947540A1 I > 



WO 99/47540 



PCT7US99/05804 



35 



<210> 62 
<211> 1410 
<212> DNA 

<213> Homo sapiens 



<400> 62 

ccgcctcctt gccgcccagc cggtccaggc ctctggcgaa catggcgctt gtcccctgcc 60 

aggtgctgcg gatggcaatc ctgctgtctt actgctctat cctgtgtaac tacaaggcca 12 0 

tcgaaatgcc ctcacaccag acctacggag ggagctggaa attcctgacg ttcattgatc 180 

tggttatcca ggctgtcttt tttggcatct gtgtgctgac tgatctttcc agtcttctga 240 

ctcgaggaag tgggaaccag gagcaagaga ggcagctcaa gaagctcatc tctctccggg 300 

actggatgtt agctgtgttg gcctttcctg ttggggtttt tgttgtagca gtgttctgga 360 

tcatttatgc ctatgacaga gagatgatat acccgaagct gctggataat tttatcccag 42 0 

ggtggctgaa tcacggaatg cacacgacgg ttctgccctt tatattaatc gagatgagga 480 

catcgcacca tcagtatccc agcaggagca gcggacttac cgccatatgt accttctctg 540 

ttggctatat attatgggtg tgctgggtgc atcatgtaac tggcatgtgg gtgtaccctt 600 

tcctggaaca cattggccca ggagccagaa tcatcttctt tgggtctaca accatcttaa 660 

tgaacttcct gtacctgctg ggagaagttc tgaacaacta tatctgggat acacagaaaa 72 0 

gtatggaaga agagaaagaa aagcctaaat tggaatgaga tccaagtcta aacgcaagag 780 

ctagattgag ccgccattga agactccttc ccctcgggca ttggcagtgg gggagaaaag 840 

gcttcaaagg aacttggtgg catcagcacc cccctccccc aatgaggaca ccttttatat 900 

ataaatatgt ataaacatag aatacagttg tttccaaaag aactcaccct cactgtgtgt 960 

taaagaattc ttcccaaagt cattactgat aataacattt tttccttttc tagttttaaa 1020 

accagaattg gaccttggat ttttattttg gcaattgtaa ctccatctaa tcaagaaaga 1080 

ataaaagttt attgcacttc tttttgagaa. mtatg.t taaa gtcaaagggg catatataga 1140 

gtaaggcttt tgtgtattta atcctaaagg tggctgtaat catgaaccta ggccaccatg 1200 

gggacctgag agggaagggg acagatgttt ctcattgcat aatgtcacag ttgcctcaaa 1260 

tgagcaccat ttgtaataat gatgtcaatt tcatgaaaag cctgagtgta ttgcatctct 1320 

tgatttaatc atgtgaaact tttcctagat gcaaatgctg actaataaag acaaagccac 13 80 

cctgaaaaaa aaaaaaaaaa gggcggccgc 1410 



<210> 63 
<211> 1231 
<212> DNA 

<213> Homo sapiens 



<400> 63 

ggcacgagtg aatgtcgagg agttccagga tctctggcct cagttgtcct tggttatfcga 60 

tgggggacaa attggggatg gccagagccc cgagtgtcgc cttggctcaa~ctgtggttga 120~- 

tttgtctgtg cccggaaagt ttggcatcat tcgtccaggc tgtgccctgg aaagtactac 180 

agccatcctc caacagaagt acggactgct cccctcacat gcgtcctacc tgtgaaactc 240 

tgggaagcag gaaggcccaa gacctggtgc tggatactat gtgtctgtcc actgacgact 300 

gtcaaggcct catttgcaga ggccaccgga gctagggcac tagcctgact tttaaggcag 3 60 

tgtgtctttc tgagcactgt agaccaagcc cttggagctg ctggtttagc cttgcacctg 420 

gggaaaggat gtatttattt gtattttcat atatcagcca aaagctgaat ggaaaagtta 480 

agaacattcc taggtggcct tattctaata agtttcttct gtctgttttg tttttcaatt 540 

_ga^^_gta^^^asia.t3ia.ca .g attagaatct* agtgagagcc tcctctctgg tgggtggtgg 600 

catttaaggt caaaccagcc agaagtgctg gtgctgttta aaaagtctca ggtggctgcg 660 

tgtggtggct catgcctgta atcccaacat tctgggaggc ccaggcggga gaactgcttg 720 

agccccagga gttcagaatc agcctgggca acatagcaat actccgtctc ataaaaatta 780 

ataaataaaa agtctcaggt gaccaaaggc tcctgaagct agaaccaggt ttggataaag 840 

attgaagagc cacaggccac tcttccctct gagccattgg gcctagtggt gtcatgtatt 900 

gtaattgctc gcagggagag cagtcttttt ggtgtaatag tgggatgtct gcttagttgg 960 

caggggttca gtccaaatgg aagaatattg ggaaataaac ctccactatc ctttatagcc 1020 
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agggactttt ttcctattta ttcataaaat aaattatagt taattatacc cataacacct 1080 

ttatttaaat ccagtgttct ccgcagcctt ttgtctattt atatgtgtac caagtgttaa 1140 

acataattat tattg'ggcat ttgaactttg tttttcttta aagaaatgct gctattaaac 1200 

atatttgtaa atggaaaaaa aaaaaaaaaa a 1231 



<210> 64 

<211> 612 

<212> DNA 

<213> Homo sapiens 



<400> 64 

ggtcgaccca cgcgtccgag catttgtctg tataatttta gttattgaat taaaatcttt 60 

tgggacccca acaggatgag atcattggcc agctggcttc ctcccacctg cacctggact 120 

gaaattcccc gtggcattag aggtgtttcg taaggtgctc cctgctgtct gtcctacaga 180 

ttgcagtggc tctgctggaa aagaacggaa ttctatgcaa gttgcgtgtg tcatgaaggt 240 

ctctgcacag tgggtgtgtt tctttgtcgt cttttctcca ctctgctctt ctgtgaaatg 300 

tgccagcagt ggacagaaca ggggcagagg tgatcagtga ccattgcaca gaatatcagt 3 60 

aagtgttgta aggtatatag tcttggccaa caaattgtaa gcaaaatacc aggaacttcc 420 

taatctagta ggaaattttg tatgcttttg acaaacatct gatcctactg acactgaaag 480 

tccttagaag gagaattgct tgaacccgga aggtggcggt tgcagtgagc caagatggcg 540 

ctactgcact ccagcctggg caataggaat gaaactccgt caccaaaaaa aaaaaaaaaa 600 

aagggcggcc gc 612 



<210> 65 ....... 

<211> 2270 

<212> DNA 

<213> Homo sapiens 

<400> 65 

tttttttttt aactttttaa acaatccatt ttaatcatct aaattattta caatacaata 60 

acatggattc atccttttta agacatggga ttgtaaaaat caacaagtga atgatgcttc 120 

aaataataca tttaaataca ttaatcaaat tttttcagtg cttaaaactt tttctccatg 180 

ggacagcagg ctctggacaa aagtgcctag catacaagtt ttcccaattt ccttctatca 240 

taccagctgc acataaaaag gttcatcacc tcctgtctcc aaagtgtctc cctactgagt 300 

gttcccaggc agacaatagt tcctgggata gtgctgtttg gtaacagaaa agcccaagcg 3 60 

tagaggacgg attaaaaggc agggaccaga ccgccatgga tacaaatccc aagacagagg 42 0 

atgccccatg ccttccccat gaagcttatc tgtctgcctg tgtctccatg attgcaggca 480 

tagagctac t tgggacctcc "aggatgattt acttagcgat atgcttttt^cattctaaga ~540 

atcaaaatgg tcctgtaatt cccaatagag aaaatagagc caattcattg ttctcccctc 600 

tcccctctga agccagtttt taaagatgag ccttacccag aaaataagcc ccaaagaact 660 

ctcatctaaa tgatcagacc cttcctaaat tacctttggc aacctaggta attctttttt 720 

attacacacc tccaacctga ccctttctac agtttcaact ataaatgttc atgcccctcr 780 

tcaaataacg ttgctaggat gaatttgcca caggtttgag tacagagaga acaagcaaga 840 

aaaatgtcag tgtttatttt aaggagagtg gccaggatgt cagtcctcat aattggtccc 900 

ttctctctct ctatcctcca aggtaagttc tttgttgact tgataagctt tagtccttct 960 

gtacaactt c_ tagaa gatgc acttaatggt gcttctt t gc a c ttccagaa c tc accttct 1020 
attctacctg taaggctgta ggggagcatc ccaatcaaca taaggcctac ccctttagcc 
acgaaaatca gccaggcatc atgtttctgc accaccacct gccttcctga cggacactgg 
tgctgatgac aaaaatggga cagtaccgca gctggtttct ctttttcgag tgtgtagata 
agaaataaaa aacattttca ttccctcaca agcttaatct agtaatataa ctgcctaaaa 
aaaatcaaac cataaataaa cctatgtgct aaacaaatca catgacttga tgacttctct 
aaaattaatg tcaaggaaaa aaggaaaagt tgatcccaag taaaatccct tgaccacagc 
tgtctgaaat tagccagggg aatgggagac accaccaaga acctcagctc tttcctgccc 



1080 
1140 
1200 
1260 
1320 
1380 
1440 
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tgtatttcaa ggggagtgtt gtggccttca caaatgaaaa ttatgaatca caaagataaa 1500 

cgtcctcact tctaacctgg tgaatcctca ggaatgtcat gaggatgaca acacagggtt 1560 

aattcatttt ttctcagtct cccccctgac tccacaaaag ctttgccttc ccaacacaag 1620 

gggctgggag gtccagtcta gacagagcat gctgttgggg taaacagtaa ccatgtgatc 1680 

ccatgattcc cagagctctg agcacaaagc ttttcatccc agtggcaact ggaatgtggg 1740 

taattctgta aactcatggc cacaccttta atgcttgggg acagtgggtg gagtcagcca 1800 

gagctctttt ccaacttcat ctagggtctt ctctctggaa aagcttagtg acgttctccg 1860 

aaggtttatt tggttaagga gtattgctaa aacacttttt aaaaatccac tttgaacaca 1920 

tgtgtaagct gaaaagaaaa tgacatatat acctccattg aagctgggaa agtgaaaagg 1980 

ctgacgaaat gtctgaaatc ctgagccttt cctggttcta ttttaataca gcgtacaggt 2040 

aacagatgat ctcatttacc ttctgaatga cccagcactc aatttcccta aaactgctca 2100 

gctccacttg gaaatcacca ggggacttga gaatcttccc cttagactca gggagacacc 2160 

cagaccagga agaagggcac tgatgttttc agggacccaa aagcccactt tttttttttt 222 0 

tttttttttt ggaattcgat atcaagctta tcgataccgt cgacctcgag 2270 



<210> 66 

<211> 1283 

<212> DNA 

<213> Homo sapiens 



<400> 66 - 

ggcacgagcg agggaacaga ttcccagggt ggggtggggt agggctgggc ctttgctcct 60 

ttgtctcctt tcccaaggca aagtgaagag aagagagtga ctccttcttc actcagggaa 12 0 

cccaggcagg gatgaagcgc ctgtggtgtc . tgagctgggt cccggggctg caggggagcc 180 

cctcagtgtt gtcctctgta ttcttctccg tgttcaaacc acagctgcat tggacatgca 240 

gtcaggtgtc ctctcactgg caccctccct gccttttcat tcttttttct ggatagtctt 300 

tcatcaagtt ttctctgcct tcaccttgct cttcctgaat cagttcacct tgaggggggt 360 

taacagagca ccttggcagg ctctgttcct ccaggtccca ggccagcccc cgggactcag 420 

ggcctgcctt cccctcacct tcttgagcag cacaaactcg ttctctgctg ctgtccgctt 480 

gttgatttcc tcctcatacc tgtgggaatg gcgagggctc attggttaat atctcactga 540 

aagcccttgc tttcacaggg cagcgttgag caaggagcag cgtgtccatc agaagataca 600 

ggtgctgggg gccgcagagt actgggcagg ggtaagtggg ggaaggcttc ctggaggagg 660 

gaacatgcta accagtttgg aagaatgaga ctgttaaaga tccagcttgg caaacgagga 72 0 

aggagcacat ggagcgaatg ccctgtggga ctctcagagg aaccaggatg tgaactgccc 780 

tccccaaatt tgagtacagc tttacaaatg acaaagcgct ccaatctgca tttcctcagt 840 

tacccttgag agcagtcttg gagccagatg cacttaaccc tcctcttaca tgggagagaa 900 

cgtgagtggt ttccccaaac attccctaaa cccagagcca gagataaccc tctgcccact 960 

gcccagctca ctgggcattt gtcctaagag tcaggccaga ggctggagga gcagagagca 102 0 

-agttccagag ttttgttggg~gtgactctgc-ttgatatgac~-caagaacaat-gccctccact 1080" 

gacctccaaa gcatttaagc tggggtgact gccaggggtc ccttggaggg acaagggcag 1140 

ttgtccagtt acagggggac tcctcctgct cacctcttct tgtagtcctc cactacgtcc 1200 

cgcacattcc tcagctccga gtccagcctc accctgtccc cagacagcgt ctccagctgc 1260 

ttccgcaggt tgctgatgta gcc 1283 



<210> 67 
<211> 1263 
<2T2>^DNA : 
<213> Homo sapiens 

<220> 

<221> SITE 
<222> (1256) 

<223> n equals a,t,g, or c 
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<400> 67 

gaggagatcg ccacctccat cgaacccatc cgcgacttcc tggccatcgt tttcttcgcc 60 

tccatagggc tccacgtgtt ccccacgttt gtggcgtacg agctcacggt gctggtgttc 120 

ctcaccttgt cagtggtggt gatgaagttt ctcctggcgg cgctggtcct gtctctcatt 180 

ctgccgagga gcagccagta catcaagtgg atcgtctctg cggggcttgc ccaggtcagc 240 

gagttttcct ttgtcctggg gagccgggcg cgaagagcgg gcgtcatctc tcgggaggtg 300 

tacctcctta tactgagtgt gaccacgctc agcctcttgc tcgccccggt gctgtggaga 3 60 

gctgcaatca cgaggtgtgt gcccagaccg gagagacggt ccagcctctg atggctcgga .42 0 

gatgatggac cgtggaaggg aagcgtctgt ggggagtgag cgcttagatg gccagcagct 480 

gctccttctg ggaagctcgc accttggcaa cagaacagcc ctctagcaga gcgtcagtgc 540 

agtcgtgtta tcccggcttt tacagaatat tcttgtccta ttttagaatt ttccggagta 600 

gtttatttgc agtctgttga ttatgtgcag tagacccggg acactgcgtt ttaccgatca 660 

ccttgaatgt ggtgcctgga tgtgcctttt ttttttttcc ctgaaattat tattaatttt 720 

ctattgtgag ttcatcagtt catagttttt ttagtaaaga agcaaaatta aaaggctttt 780 

aaaaatgtac aacttcagaa ttataatctg ttagtcaaat atttgttatt aaacatttct 840 

gtaatatgaa gttgtaatcc tggccgtgag cttggaagct tacttttgat tcttaaagcc 900 

tatgttttct aaaatgagac aaatacggat gtctatttgc cttttattgt aacttttaaa 960 

tgaaataatt tcatgtcaat ttctattaga tatatcactt aaaatatttg gttttaaatc 1020 

acaagaatat gtattcttta ataaagataa tttatgatca tggtataatt aattgaaatt 1080 

tattaaaatc tgtttttatt aaaaaaaaaa aaaaaaaaac tcgagggggg gcccggtacc 114 0 

caattcgccc tatagtgagt cgtattacaa ttcactggcc gtcgttttac aacgtcgtga 1200 

ctgggaaaac cctggcgtta cccaacttaa tcgcctttgc agcacatccc ctttgncagc 1260 

ttg 1263 



<210> 68 

<211> 1617 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> SITE 
<222> (1578) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (1586) 

<22 3> n equals a, t , g, or c .— - _ . _ 

<220> 

<221> SITE 
<222> (1605) 

<223> n equals a,t,g, or c 
<400> 68 

_tcqacc.cacq_cq_tcccrqgaa acctaatactj gcaacct.gca _at g t aggat g_t tt.gjtat ggc 6 0_ 

atttaaaggt aatggtgatg tttattattc tatactttgc atactgtgag agtaattttc 120 
actctgtctt aagtgtgagt aagcctcttc taaaaatctt gttcttgcca agaaatttat 180 
aaatcacata cgaagacgtc tgttgctaac agttaacttt atgaggtaac tatatccttc 240 
tatttctctg gactcatttt taaaaaatat gccgaatact gcatactgtt taaggtagta 300 
tataagttta tgagagaagt ggagagcttt cttccttgaa aagtcggtat ttgttgagat 360 
accatttgcc tcacagagag gtgttcccca ctcccatccc cattgccaga taaataaata 420 
ttttgagaaa agtgacctaa aacagctgga aatcttaggt gcatctgtct gcagacctcc 480 
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ttaagcaggc tgtatcttac aattccctta ctgcactggg taagtgttaa cttagttttt 540 

gttttgccct ttgctttaaa tattctccaa attaccattt atgcaacatg gttagggtta 600 

atactgcatg gtattcattt acttgtttca tgaactttcc agtactgtac aaggtcaaca 660 

aagtaatgcc tgtggtatcc tcatctctca cttttttact ctgtggtttt agcacagtaa 720 

ggtactgcaa agaccttcct tccaaatgtc tccttgactt tattccttgg gccaattcag 780 

tatcctcaac atcctaagat tttgttgttt tatcactgac ctgtggttgg cctgttttat 840 

tctaatttcc agaaaagttc aatcccagta tttgcaatat caaataactc taaaaccgat 900 

gttgtgattc taccttcctt actattttta ctgggcaaat gccctatttt tttaattatt 960 

attattttta acttttggga cacacaaaaa tcagcaattc tcatgaagcg tttgttagtg 1020 

tggcagactt gtctaattcc tgaaactcat tcatcccctt gagccagcca atggggagga 1080 

ataggataat gcaaacacat gttttgtttt ctcattttca aataatttac catgttaaaa 1140 

taaacttttc tttgtttttt atttgtagag tcagctaagt acccatattt aaatgccgtc 1200 

tttattattt ttttgaggtc tttgtttttg tctgtttttg ttttgttttg ttttgtaaat 1260 

aaggtaactg ggcaatcaaa caccttttgg ggattctggc tttagtattt tatcagccat 1320 

tttaaaatta aatataaaaa tcctttgtaa gaaacttgca tcctaatttt tctttattgc 1380 

aattgaaagt gtaaataata agacaatgta agtaagacct tcctaatgtc taatacaaac 1440 

tgggcttcag caagtggcct atttttatta gggttttgaa aggttgtgtg tgtgtgtgcg 1500 

tgccgtgtgt gtggttttct tttttaaatg gatagtagag tggtggctgg ataagggtac 1560 

ctgtaatggg ggtttggnca gcaagnctga aatttatact tttgnaaata aaactac 1617 



<210> 69 
<211> 1389 
<212> DNA 

<213> Homo sapiens _ . . 
<220> 

<221> SITE 
<222> (755) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (1177) 

<223> n equals a,t,g, or c 



<400> 69 

gcttttttag gcattcattg gacacttgct ttaataagaa tagtttctta gttggcataa 60 

tgcttctgaa gtaatggtac tttaaaataa tttatctcat gaactttaag cattcctctt 120 

gaaatcttgt ttttactcta tctagtcagc "ccttataggc aatccaaagg gatgctttgg 180" 

atgctttagt ccagtggttc tcagagagtg gtctgtggag tcctggaagt cgctgagacc 240 

ctttcaggca atttgcaagc tcaaaactaa tttcagaatg acaccaggat gttctgtgcc 300 

ttttttgctg tgttggctat ttgcactgat gatgcaagag aaatggggag gagtaaaatc 360 

actggtgtct taccactatt caagacagtg gcaccaaact gtagtagtct agtaggcatt 420 

gtatcctgca cagcctcccg ccaggagtgt gatggaagac caaggaagca gtgtaaatgg 480 

aggtacacgg gaagcacttc ctttgcaccg tgaaggatga tgactgtttt aaggaaaagc 540 

acttatgcca tggtttgtgt cgtgagctga accagctgcc tttttcatgg aacatgactt 600 

_ ttacttgaaa, gag taactga caga aaacca attattctga cttgtg tttg ta gcagacat 660 

tttcttgaaa atgaagaagt gggtctgtga cttcaggaaa atgtctgaca gtatctgttg 720 

ccaaagaaaa tgtgagtttt caagccaaga atagnaatct agaaaacttg tattcmcctc 780 

catgggcttg atgmcctctt agtacttaga cctttctgac gagataagcg gtggcattaa 840 

caaatgtgac gtttttcatg ttatctaaat acatttgtca acatttrraa gatctgcaca 900 

actccctgga ccaggattty ccaaatgatt gttgcttttt gttacaaaat cagggaatag 960 

atagaagatt cattcaaata atgaaagata gactgatgga tttttatgta acagaatagg 1020 

aaaagtttat gacatgtttt cagattccac cttgcaacta atttttaaga agctaccact 1080 
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tgtcagccag gcacagtgct cacgcctata atcccaacac tttgggagtc caaggtgggc 1140 

agatcacttg aggtcaggag ttcaagcctg gccaacntgg tgaaagactt ctctactaaa 1200 

actacaaaaa ttagctgggc atggtggtgg gtacctgtaa tcccagctac tctggaggct 1260 

gaggcaggag aattgcctga gcctgggagg cagaggttgc agtgagccaa gatcgcgcca 1320 

ctgcacacca gcctgggcaa caagtgcgaa actctgtctc aaaaaaaaaa aaaaaaaaaa 13 80 

aaactcgag 1389 



<210> 70 

<211> 1896 

<212>^ DNA 

<213> Homo sapiens 



<220> 

<221> SITE 
<222> (1802) 

<223> n equals a,t,g, or c 



<220> 

<221> SITE 
<222> (1856) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (1886) 
<223> n equals 



a , t , g , or c 



<400> 70 

aaaacaaaaa agctaataat ctcctcaagc aatttctggc ctaatagaat tatagtagac 60 

agtgaagtat ctaaacccag ggaatcagat tgaggcacca tgtccatcgc cttgagaatt 120 

aataggctgc atttctgggt tctccttttt tttttttttt ttgcccaact gagtctttct 180 

gtggacttac atggaacttc ttattctctt aaatcattaa gttacttgac aatattcttg 240 

gatttggaga aactggatgt agggccgtat gaaaaaatca ttcgaaatca gatttagggg 300 

tataaggttg gataggaatg ttttagaaag aagaatgtaa ggcagataac taatttgtca 360 

catccaaagt ataaaactgc tact ttt tec ctagaaaagg gaagctcatt ttaggcagee 420 

taaaccagta agattttctt cctcctccaa gtgcagattt ttgtaccttt cgtttgtcaa 480 

aacattcttt ggccctatgc atgccagagt gatatagaaa ggaagttacc acattttttt 540 

gagaacaaat cactcctgat aaaatttctt agacaattga taatcatttt aagaagaaat 600 

ttaattgtat ttagctctgt gtctcgcccc tttggtgtca ctcttctacc tcttccatca 660 

ctatagctaa atatttagaa gtatatcttg acacctagca caaatgtttt ggttaagtat 720 

cttaaaactg atggatggta tggctggggc agcatggctc acgcctgtaa tcccagcact 780 

ttgggaggcc aaggcgggtg aatcacctga ggtcaggagt ttgagacegg cctgaccaac 840 

ttggagaaac cccgtctyta ctaaaaatac aaaaattagt csggtggtgg cgcatgcctg 900 

taatcctgtc tactcaggar gctgargcag gagaattgee tgaacccggg argcarargt 960 

tgeartgage tgaratcgtg ccattgcact ccagcctggg caacaagagc aaaactcagt 1020 

ctcaaaaaac aaaacaaaaa acctgttggt atagtacgaa agaaaegtet tgcagttttc 1080 

tgttgcagag a attaattag aaccaacctg' ttggattata cacattcacc tttcagaatc 1140 

ctttcttctc tgtggaaacc cacactctcaT "gcagtgtgtg ggaacacagF^g^ttcttaa 1200" 

ggaatgcttg ttgaatgttg cagtctgeat cttcttgaag taacagaact gttggtagct 1260 

gtttaaaagt aaaatgtgtc taaagacctt ttggaaatta agatgtaaga gattaatgea 1320 

ccaaagcagt ctcttaatta cttaaaatga attatttcaa agaatcttta attgaatttt 1380 

ctgtgaagtc tggaatttgt aaattatgtc cctttgttca aaccagcccc tgaaaagaac 144 0 

aattaaggca attaagatag cattaaagtt ttcaatgaag ttggcatttt cygtgtatta 1500 

agattagatg ttagctgctg aagtttgtgg aggteggaca taaagcttcc aacatcagta 1560 



oucrwirv ~\Atr> nciATCArtu 1 i ^. 
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atgcaaaatt gtcttgaacc tgcgataaaa ttttgttgga cttttttttc attgcagtgr 1620 

aaagggccat gtagcatgcc tcaaagccag gttactcagc ctagtccttg tttaagcagt 1680 

tttgatattc atycaagttc aattttcyca cctgatttwa kgattaattt cctkggaaaa 1740 

attttgaaaa gttttccaaa gaaagtaaaa aatttaaata atccggtaac cccgtataat 1800 

angaggatta aaccttccag gttccaaatg gttttgggtg gcaattttcc cttccnaagg 1860 

tccctttttt ttcccaatgg gtaatnaaat aattaa 1896 



<210> 71 

<211> 308 

<212> DNA 

<213> Homo sapiens 



<400> 71 

ggcacgaggc ggcgctgcga ggacccatgc agctgacgct ggggggcgcg gccgtgggcg 60 

cgggcgccgt gctggccgcc agcctgctct gggcgtgcgc cgtgggcctc tacatggggc 120 

agctggagct ggacgtggag ctggtgcccg aggacgacgg gacggcctcc gcggaaggcc 180 

ctgatgaggc gggtcggccg ccacccgagt gagcgacacg gccgtggggc ctggcaggcg. 240 

ctggacagcg cccgaggact gggacattaa acctgacctc ccctcctcca aaaaaaaaaa 300 

aaaaaaaa 308 



<210> 72 

<211> 1688 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> SITE 
<222> (912) 

<223> n equals a,t,g, or c 



<400> 72 

acccacgcgt ccgctcatgt ggacttatgc cagtctagag gcagaatcag aaggcttggt 60 

tgaacatatc gctttccctt tttcctctcc ctccgcccct cccagtacag tccatctttc 120 

aatgttgcag cctggttgag aaggagagaa aaaggtggca ggaatttcca ggagatcccc 180 

aagaatgctg ccttgtctgt ggacaaagat ggaccatgtg cccttcggaa ttagggatag 240 

aaacaaatat tgtgtgctct taacgattaa gctgtgttat ggtgggtttt caggttttta 300 

ccttttttct ttaccccttt actctgcaag aatggggaaa gaatgcatac tgcgaaaatg 3 60 

agtcttttaa attctgtctg"cctactagtt ttaagtTata't" ggtatgttgt aaaatttcca" -420 - 

atgatgagag acagcacaat aaatgtacct tatctcctta ggctgaaggc cataactaca 480 

tagtggagta atttaagaac tctcttgcct tcaccaaccc aaaaggttgc tttttgatag 540 

caactggcta atgaattttt aaaaagagaa gaaaaatact agttttcccc tcttttggga 600 

aatagatttt aaatggctaa actactagcc ttaaaactac tagtctataa atcaactacc 660 

acttttgtga atctgacagg ccacattttt atatggccct ttacagaatg gagtgtgttg 720 

aacaggatac taacgccatg gagttgagct gggcctagcg atggagggac actctaacac 780 

aactttccct cagctattat gcaacagatc agggaaaaag atgggatgac agatggggtc 840 

agacagaaag_agcttctgg g aaaca a gctt acatagt ctt ttttaaaatg^cacaaagcct 900 

cccagctaag angtcacttg gtttgggctt cattaggact ggagactttg ttggagttct 9~60 

ttctgggaac ttggagagtg gatgatattc aggctctgaa acattcccag cgctctcccg 1020 

agggtgccac tttctcaaga tgaaaactgt gactgaaaaa attaataata aatgtttctg 1080 

agctgcctgt gttctccctg tgtgggtgag agaagggact agactcctaa gcctgcctca 1140 

gatacaagag ggatcattgg ctccaatttt agagaacttg aaagcaaggc tttggacaaa 1200 

attttgagac cctaatcact ttaccttcct ccaaattacc caacatacgg taaacaacat 1260 

ttgtgcagaa gtatgtatgt atttagttca ggttgacttg tgtccttata aactgttact 1320 
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caaatgattt gaacttttat gcgactggga tttttttttt ccaaagctac aagcatggcc * 1380 

gcctgtggta tcgaggtgtt gcaaacaata tctgtgttgc gcttcctgtt ttaacctacc 1440 

tcgttttgtt tgtttttgtt tcactgttca tcacagcagt gttatctcca ggagacatat 1500 

agagagctca accggcaatc tcaggtgcat ttaacatttt taaaacgaaa cagtagttga 1560 

ccaaattttt cttcttaaaa aattggaagt ggggggaatc caatgacaaa aactaatgtg 1620 

gcttgtttct ggagaaaata attactgtaa atggaacaac aacaacaata aaacacacgt 1680 

taaacatc 1688 



<210> 73 

<211> 1138 

<212> DNA 

<213> Homo sapiens 



<400> 73 

gggcgcctgt agtcccagct attcaggagg ccgaggcagg agaattgcct gaactcagga 60 

ggcggasttg cagtgagccg agatcgcgcc attgcactcc agcctgggtg acagagtgag 12 0 

actctttctc ccaaaaaaaa aaaaaaaaaa aaagtcaaat gcagctggga atgtggttcg 180 

tgcctttttg tatattaacc atttgaaact tggttgtaag gtggggttgg caatgtcagg 240 

cctggctgca gcagctcatg tctttagagt gtgcctcttc cctctctcgt ggggctcgag 300 

caagactacc ttcatacatg ggctctccag ttacatagca actccagtgt taaattccat 360. 

cttttcttcc tggaaaagcc gtagaaagga cacctggaca tgcctgctgc acaggttgtc 420 

tgccttcccc atcagccsca gaaggaggaa ctttgctctc ttctctcaca gctgtgtgtg 480 

cataagaagt agttcggatg atgtgggtcc caccatgtat tccttctctg ttccatgtag 540 

agtaaaataa atgggagttc tgtttaatgc atcacctcgg ttcatafctgc atttgccaag 600 

aaagtgcaat tttattgaac a-ttaggattg aattcttaac tgagtaatca atttcagtag 660 

taagttaaaa tgccttctat taatggacaa ctgcaaccgt taatcagagt tacagtagat 720 

taacagttgt cagcatttat gctaatagca ctaataaacc gtgggctcat gatttgcact 780 

ttataattcc atatttctca aaacagttgg taatactttt tgcttgaagg tattgattct 840 

tttgtccctt tgcttgctac ttggagatgt agagaaagct aaatgacatt ttcacggtga 900 

tgacacaata tcaccttctg cttttgcaca cttggctttg tgtcaaaata gatggaaagg 960 

gttcatttgt tctggtgctc tactgtttaa tttgatctgg tgtgtgacta aagcaagaca 1020 

aatagtattt ttaatgaaac catttaataa cctctggtag cttagagtcg aaggcattgg 1080 

aaaaatgcaa ttaaaggatg cctagatgta aacaaaaaaa aaaaaaaagg gcggccgc 113 8 



<210> 74 
<211> 777 
<212> DNA 

<213> Homo sapiens" " ~ 
<220> 

<221> SITE 
<222> (761) 

<223> n equals a,t,g, or c 



<400> 74 

gtagcacctt gaaattgggc agttctgaatr atgct gttga gtaaajggaa_aatcac t a t c 60 
tttttaggac ctttggaatg tggctccatg catttgctaa cgttgttctc ttcagggctg " 120 

atttttctgg gctgttctac tcctctatcc ttctgtgatt gtcttccaat tcttttatta 180 

tggttagagt tccctgtaga aaccagtggg gtgtgtagtt aacaagtgtc aaaaggagta 240 

gaataattac tttatgtgat gtacttacga gaatactact tagtagaatc cagtataacc 300 

aaacaaaatg tagacgtatt taactatcca gtgtgtcagc tacacatttt ctcatcttta 3 60 

tcacttctgc tctggatatt gtgatctact ttcctatctc tttgtatttg tkttttcaca 420 

tttctttttt gtgtaaactg ctctatactc ttattgaaac ttgagcataa ttttatattt 480 
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acatagtaaa 
gccattgatk 
aacttactaa 
aacttcgwtt 
tgtcttccaa 



gtttctgata 
tttcatttta 
ctttaaatat 
tcttgatgat 
ggatttatgg 



tgattagaac ataagttgtw tctcctaatt ttccaataga 
acccctttta atggaacact tactagcttt ctaaattata 
acatgaatta tttctcagca ttcttaaaac aagggtactg 
tcaggggaaa agaattctga gtgttggaaa ggactttaga 
gatcaattta agaaaaaaaa naaaaaaggg cggccgc 



540 
600 
660 
720 
777 



<210> 75 

<211> 1060 

<212> DNA 

<213> Homo sapiens 

<400> 75 

gatgtatttc cttaatatgt agtttcagaa gtggaattta ttagagttaa actaaactca 60 

ttaaatttag agtttcttat tgtctttcat gagaacattt ttccttttca ttcataaatg 120 

atattgaaac actatattct tactttcata ttcctgttta tatttttgtt tttcatgtta 180 

aacattttac attctaatag taacctcatc gacctgttaa aaggcaatat aagatttaga 240 

ttattaaata gcatgtaata tatgtgatca gtaatcttca atgagcttgk tcttcattta 300 

attgcaacgt tatgtctgat tttttttgkt gcaaagcttt cagaatcttg acttgtggta 360 

atcttctttt aaaaaagctt ttaacagaat taatargtca tcacgttatg ataaatgatt 420 

aaggaaatga tgcctctaat acatkgaatt attaaaacta tcattttgaa aaattatatt 480 

ggtacaaact agtgtctact gctattactc atacatttca gaattcatac atggatatcg 540 

tctaggattt tttttttgcg taatcatgag ttacggtgtt aaagttatag tgttaattta 600 

attatgttat agtgttaatt tatctgtttt acatctcact tttgtatctg aaaccgttcg 660 

aaaataatta ttattaaagg ccagttgaca ..aaatttccac .tcctcctccc cagtgtgact 720. 

ttccttattt gtattatacc tataaagact acctcttaca tcggccaggc acagtggctc 780 

acgtctgtca tcccagcact ttgggaggac gaggtgggca gattgcctga gctcaggagt 840 

tggagaccag cctgggtaat atagtgagat cctgtctcta caaaatatac aaaattagct 900 

aggcgtgcct gtagtcccag ctacttggga ggctgaggtg gtaggatggc atagagtcca 960 

ggaggcagag gttgcagtga gctgagatgg tgccactgca ctccagcctc agtgtcagag 1020 

ccagtccctg tctcaaaaaa aaaaaaaaaa gggcggccgc 1060 



<210> 76 

<211> 1503 

<212> DNA 

<213> Homo sapiens 



<222> (6) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (18) 

<223> n equals a,t,g, or c 



<220> 

<221> SITE 
<222> (41) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 



<220> 
<221> 



SITE 
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<222> (1501) 

<223> n equals a,t,g, or c 



<400> 76 

gtggangccg ctcctganaa ctagtgggtc ccccgggctg ncaggattcg gcacgagaat 60 

gaatggcaaa gaaatagaag gggaagaaat tgaaatagtc ttagccaagc caccagacaa 120 

gaaaaggaaa gagcgccaag ctgctagaca ggcctccaga agcactgcgt atgaagatta 180 

ttactaccac cctcctcctc gcatgccacc tccaattaga ggtcggggtc gtggtggggg 240 

gagaggtgga tatggctacc ctccagatta ctacggctat gaagattact atgatgatta 3 00 

ctatggttat gattatcacg actatcgtgg aggctatgaa gatccctact acggctatga 360 

tgatggctat gcagtaagag gaagaggagg aggaagggga gggcgaggtg ctccaccacc 420 

accaaggggg aggggagcac cacctccaag aggtagagct ggctattcac agaggggggc 480 

acctttggga ccaccaagag gctctagggg tggcagaggg ggtcctgctc aacagcagag 540 

aggccgtggt tcccgtggat ctcggggcaa tcgtgggggc aatgtaggag gcaagagaaa 600 

ggcagatggg tacaaccagc ctgattccaa gcgtcgtcag ccaacaacca acagaactgg 660 

ggttcccaac ccatcgctca gcagccgctt cagcaaggtg gtgactattc tggtaactat 720 

ggttacaata atgacaacca ggaattttat caggatactt atgggcaaca gtggaagtag 780 

acaagtaagg gcttgaaaat gatactggca agatacgatt ggctctagat ctacattctt 840 

caaaaaaaaa aattggctta actgtttcat ctttaagtag cattttgctg ccatttgtat 900 

tgggctgaag aaatcactat tgtgtatata ctcaagtctt tttatttttc ctctztttcat 960 

aaatgctctt ggacattatt gggcttgcag agttccctta ttctggggat tacaatgctt 1020 

ttatcgtttc aggcttcatt ttagcttcaa aacaagctgg gcacactgtt aaatcatgat 1080 

tttgcagaac ctttggtttt ggacagtttc atttttttgg atttgggata gattacatag 1140 

gagtatggag tatgctgtaa ataaaaatac aagctagtgc tttgtcttag tagttttaag 1200 

aaattaaagc aaacaaattt aagttttctt .gtattgaaaa taaccta.tga ttgtatgttt 1260 

tgcattccta gaagtaggtt aactgtgttt- ttaaattgtt ataacttcac acctttttga 1320 

aatctgccct acaaaatttg tttggcttaa acgtcaaaag ccgtgacaat ttgttctttg 1380 

atgtgattgt atttccaatt tcttgttcat gtaagatttc aataaaacta aaaaatctat 1440 

tcaaaacaww aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 150 0 

naa 1503 



<210> 77 
<211> 872 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (844) 
<223> n equals a,t,g, 

<220> 

<221> SITE 
<222> (858) 
<223> n equals a,t,g, or c 



<40 0>_7-7 : 

ggggaagttc ttcactgcct tgcatttgac tccagatccc tccatcctcc cagagccttg 60 

gcctcaaaaa tgctgattct agcatcatgg aaatgctgtc ctcaaagtgg tctaaacggg 120 

ttgctgcttc acttgctcac ttaatctccc ttttcatagg gctgttgttt ttacttctgg 180 

gaagttctgt ttaccctgga acagaaactc tcttccctaa aagttgattt tattgaccca 240 

tggaggccag agacacttag gcatattttc cctccagact agaagcttct gaggaggacc 300 

tcctgagtct gcaccctggc tccctgctgt gctgagggcc cccgtgttaa cctcacgttg 3 60 

tgcctcctct gattcagagg gcccagtgtg gttctgtcag ccaggcagtg gccccagctc 420 



or c 
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tacagaaatg agttgtcatt gcatcctagg gccagggtct tcgtgcttgt gtgtgttacg 480 

tggaagtatg tggacaccaa gtgttcctgg atggccacag cctgcgaagg aaactggggc 540 

cagcagctgc tctgtgtttt cagccaacaa tggctcctgc ccactgccgc tgcataacca 600 

ccagaggcag gcttctcttg acacaggcct gtcgttggag catgtgcctg gcgagtccta 660 

tttctattcc cctgtgggtt agggacaggc agctgtacct tcagtgtgtt gctggggcag 720 

gagaatcgct tgaaccggga ggcggaggtt gcagtgagcc aaaattgcac cactgcactg 780 

cagtctgcag gacagagaga ggctmtatct caaaaaaaaa aaaaaaaaaa actcgagggg 840 

gggnccggga cccaattngc catataggaa aa 872 



<210> 78 

<211> 573 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> SITE 
<222> (560) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (563) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (566) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (567) 

<223> n equals a,t,g, or e 
<220> 

<221> SITE 
<222> (571) 

<223> n equals a,t,g, or c 



<400> 78 

gatcaagttc cttagttttg acaatcaggt cccaaactct ttttcttgee tcatttattc 60 

attcaacaag tattttttgt gcactgaata tgtggccctc actaggcaga tgetgectat 120 

tettttgect gttaactaat ttaacctctt gtcatacctc ccaaatcacc ttatgetcca 180 

gagaaacttg tgtatggtca cgtaccacat aatgatgett tggtcaacta cagatagttg 240 

atccagtaag attaccatgg agctgaaaaa ttccaatggc ctagtattta ctgtgctttt 300 

aattattatt ttggaatgta ctccttttac ttataagaaa cttagctgta aaacagcctc 360 

agtc.at.t_t: t,c_ttcatga_ggt_a t ttca gaaa * at qqcattac^atca^aa^g^a^g^t_ 420 

teatgeattt tattgcccct gaagactttc cagtgggaca agatgtggag gtggaagaca 480 

gtgatattga tgatcctgac ectgeataaa tctagcttaa tatgtgtgtt tgtcttagtt 540 

gctgacaaaa aaaaaaaaan aanaannaaa naa 573 



<210> 79 
<211> 1509 
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<212> DNA 

<213> Homo sapiens 



<400> 79 

ggcacgagga tgtacctaat gagcttctcc attcactttg taaaaataat ttgtatgtgt 60 

accatcttgg tcctctcccc tcccgttttg tta.aaatatc aggatagcac tcccaggcca 120 

ctttggtctc agtgtaagat ccctattaac tatctgaaag gaaaatagag ccaagacctc 180 

tggtctcaaa tatataggaa ttgcctttct ttagtcttca ggactattgt gtgaaaacaa 2 40 

gtaggggtct aatctcctag aaggtagggg ctttatcctt aaagagaa.ta tgtccccaga 300 

ttattagcac ttttagagga gaagccaagg tatgtagggg tgtgtggctg gcccatcagt 3 60 

ggagcacgaa gagagaatgg gataccattg tgggaagaga agaaaagttc ctcaggggcc 420 

tcccactgct aaagtttttt gtgagatgtt gatctgtgct tcctggattt gacttttaaa 480 

ggaattattc tggcagcaca tgtagtattc ttggatgatc ttgctgctct tatttctcct 540 

tttgtgtgtg tgtgtgtgtg tgtgtggcta tgggttttca tttgtaactc catctgctta 600 

ggagagtggg ctctctataa gggaacctgc tgtaaacttc attgcagcaa ggatgtagag 660 

agaaatagga cttaattcca ctaggggctc tcatctcaca ccttaaggag gagatttcta 720 

gaaaaactgg gccagatttt ctttgttctc catcatttta atgtggcagg ctgttcagtt 780 

ttcttactct tacctatgtg atatttcttc gtaacgtgtc caaaaagaaa aaagacccaa 840 

tcagtgtctc ttgactttgt tctttgatcc ctcagtttct tcttgatttc agcatgtgtc 900 

gggttcctaa ttttgggtat gagttagcaa atttaaccat tgtgtttgtg ccctacccag 960 

gggactcccc agtttctgac ttgaagtaga ctgagaagaa tccacgaggt gctatctggc 1020 

cagatttaag tagattctat ttccttggtt ctccctctcc ctgaggacct cttattttat 1080 

tgtcccctct tctaggttaa ttctcctttg atttgacttt gttgagaagg aggttggaca 1140 

gtagattagc aaagttccaa gtgcaaaatt acagtgtgtt agagtgtggg gggaaaatta 1200 

gtcttatttt tccctacatg, ggatacaaca ctgigaattc aatcttcaac tgaaggccct . 1260 

gcagttctcc taaaacatag ttgtttgttt ttctttaaca aagtttaagc tagtgttaat 1320 

aaattaaaaa aaattgcttg tctgtctact tcagctttgt tttatgccca tttcatattg 1380 

ttgtctgtgt tgtaattcat aacttttgat accatttctg atgtgtaaaa ttggttgtct 1440 

tgtaaatatc ttataaagag ttcaattgta aataaactat tgtggctgtt aaaaaaaaaa 1500 

aaaaaaaaa 1509 



<210> 80 
<211> 1109 
<212> DNA 

<213> Homo sapiens 
<400> 80 



-CGaGgcgtcc-ggccgcagaa-cgggctccgc, ggacgacggg ctccagggac. gcacaggcag 60 

cgggcctccc accgcgggtg ccgggggcgg gggggctgcc cccatgcggg gcccttcctg 120 

gtcgcggcct cggccgctgc tgctgctgtt gctgctgctg tcgccttggc ctgtctgggc . 180 

ccaagtgtcg gccagggcct cgccctcggg gtccctgggc gccccggact gccccgaggt 240 

gtgcacgtgc gtgccgggag gcctgccagc tgtcggcact ctcgctgccc gccgtgcccc 300 

cgggcctgag cctgcgcctg cgcgcgctgc tgctggacca caaccgcgtc cgtgcgctgc 360 

cgccaggtgc cttcgcggga gcgggcgcgc tacagcgcct ggacctgcgc gagaacgggc 420 

tgcactcggt gcatgtgcga gccttctggg gcctgggcgc gctgcagctg ctggacctga 480 

gcgccaacca gctggaagca ctggcaccag ggactttcgc gccgctgcgc gcgctgcgca 54 0 

~acctctcatt~ggccggcaac-cggctggcgc gccfcggagcccgeggcgcta-ggcgcgctcc 60.0_ 

cgctgctgcg ctcactcagc ctgcaggaca acgagctggc ggcactcgcg ccggggctgc 660 

tgggccgcct gcccgctcta gacgcgctgc acctgcgcgg caacccttgg ggctgcgggt 720 

gcgcgctgcg cccgctctgc gcctggctgc gccggcaccc gctgcccgcg tcagaggccg 780 

agacggtgct ctgcgtgtgg ccgggacgcc tgacgctcag ccccctgact gccttttccg 840 

acgccgcctt tagccattgc gcgcagccgc tcgccctgcg ggacctggcc cgtggtttac 900 

acgctcgggc cggcctcctt cctcgtcagc ctggcttcct gcctggcgct gggctctggg 960 

ctcaccgcct gccgtgcgcg ccgccgccgc ctccgcaccg ccgccctccg cccgccgaga 1020 
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ccgtccagac ccgaaccccg atcccgaccc ccacggctgt gcctcgcccg cggacccggg 1080 
gagccccgtc cgctgccgcc caagcctga 1109 



<210> 81 
<211> 807 
<212> DNA 

<213> Homo sapiens 



<400> 81 

cccacgcgtc cggacgtcct gatagatcct ctgctccaat aggcaactcc ggccttcccc 60 

gccctgacct ggaacctctg ggagggctgc agagtaagtg ccgcctctgc gctccgacgg 120 

aggcacgagg cctgtggagt aggtccctct gttccgacag gtgcgacact tggcgctcca 180 

tgcttgcggg tgccgggagg cctggcctcc cccagggccg ccacctctgc tggttgctct 240 

gtgctttcac cttaaagctc tgccaagcag aggctcccgt gcaggaagag aagctgtcag 3 00 

caagcacctc aaatttgcca tgctggctgg tggaagagtt ' tgtggtagca gaagagtgct 360 

ctccatgctc taatttccgg gctaaaacta cccctgagtg tggtcccaca ggatatgtag 420 

agaaaatcac atgcagctca tctaagagaa atgagttcaa aagctgccgg ttcagctttg .480 

aatggaacaa cgcttatttt ggaagttcga aaggggctgt cgtgtgtgtg gccctgatct 540 

tcgcttgtct tgtcatcatt cgtcagcgac aattggacag aaaggctctg gaaaaggtcc- 600 

ggaagcaaat cgagtccata tagctacatt ccacccttgt atcctgggtc ttagagaccc 660 

tatctcagac agtgaaagtg aaatggactg atttgcactc ttggttcttt ggagccttgt 720 

ggtggaatcc ccttttcccc atcttcttct ttcagatcat taatgagcag aataaaaaga 780 

gtaaaatggt aaaaaaaaaa aaaaaaa 807 



<210> 82 
<211> 1043 
<212> DNA 

<213> Homo sapiens 



<400> 82 

ggcacgagtt gggccgggca cccccagaag ctgaccttga gacaaggatt tgggtgcaag 60 
tggtttattt ggcaggtgcc cagaaagtgc tgacaggagt gggaaagtga gttaggggag 120 

agaaggaagc cactacaggc tatgttcatg tgcaggttac tgctgtgggc aactggggct 180 
tacggatttc taggagatga cgtggaatac acctcagtgt tgccccacca gaagggcaag " 240 

gaagcatggg tatttatatg tcagctccca ttcattattg gctgagggca gctcctagag 300 

ggcattgggt ctgcgtttca agcctgctgc acataggctg agaggaatcc ctgagttcga 3 60 

gtcacaggcg cccacagtca tgctcagaca gcacatacag gaacagtgac tgcagggggc 420 

ataggtggga cacaaatacc accagttata aagaggaaag atgggaagga aagacaagag 480~ 

gaaggtgtgg agttagattc ctgggtcaga tgtgaacccc tggctctcaa aacactcctt 540 

ctttttttct ttttcttttt ttttgagaca ggatctcact ctgttgcaca ggctagagtt 600 

cagtggtgta atcagggctc gtggcagcct ctacctccta ggctcacatg atcctcccac 660 

ctcagcctcc tgagtagctg ggactagagg cacacatcac cacacttggc tagtatttaa 720 

atttttctgt agaagtccag gcgcagtggc tcatgcctgt aatcccagca .ctttgggagg 7 80 

ccgaggcagg tggatcacct gaggtcagga gttcaagacc agcctggcca acatggtgaa 840 

accccgcctc tactaaaaat acaaaaaaat tagcctggtg tcgtggcagg ctcctgtaat 900 

cctggctcct tgggaggctg aggtagga ga atcacttgta cccag aatgt ggagcttgca 960 

gtgagctgag atcatgccat tacactccag cctgggcaag~aagagtgaaa ctccatcgca 1~0~20~ 

aaacaaaaaa aaaaaaaaaa aaa 1043 



<210> 83 
<211> 1173 
<212> DNA 
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<213> Homo sapiens 
<220> 

<221> SITE 
<222> (548) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (603) 

<223> n equals a,t,g, or c 



<400> 83 

gctgtctcag aaaaaagaaa aaagtttcta aagtaaaaat tgaaagtact tcccctacaa 60 

ccacaggttg ctttgacaga ttaatgtaaa ttcttccaga tactcttctg tggatgtaga 12 0 

aacatgcaga atgaggcaag ctttaatttg cttatgtcac ttactgtgga tagcctttca 180 

tatcttataa gttaatgtca gagcagcaat ctcatttttt tccaatttgt aaacatttta 240 

tttaacctta tgatggatat tttggtggat ttcagtatta caaaaatgcc tattaatagt 300 

atattttcat tatatttctg ttacgaaatt ataatgctac aaacattact atgcctgtgg 3 60 

cagtatacat ctgcacaagt tttgaaaatg ttatgcattc ataggcaaaa atgggataac 420 

ttttgggcag tggtcatgat taatctgttg atcagaatcc agagattgcc cttctccttg .480 

ccaattgctt taagagtacm ctagtttttg gccgggtgca rtggctcatg cctgtatccc 540 

agcatttngg aggccaagac gggcggatca caaggtcagg agatcgggac catcctggct 600 

aanttggtga ggccccattc tactaaaaat tccaaaaaaa cccacmaaaa ccaaaaaaac 660 

ccrgccttgk tggtgggatt acaggcatgt gcmacaacac ccggctaatt ttttttgtat 72 0 

ttttagtaga* ggtggggtgt caccatgttg gccaggctga tctcaaactc ctggccttaa 780 

. gtgatctgcc cacctcggac tcccaaagtg cggaattaca ggcgtgagcc accgcgcccg 840 

gccactggtt tttaaacttt attttgaaat tatttcaggc tgggcgcagt ggttcacgcc 900 

tgtgatccca acactttggg aggccgaggc gggcggatca cgaggtcagg agatcaagac 9 60 

catcctggct aaccccgtct ctactaaaaa tataaaaaat cagccgggca cggtggcagg 1020 

tgcctgtagt cccagctact cagtgggctg aggcaggaga atggtatgaa cccgggaggc 1080 

ggagcttgca gtgagctgag atcacgccac tgcactccag cctgggagac agagtgagac 1140 

tctgtctcaa aaaaaaaaaa aaaaaaactc gag 1173 



<210> 84 

<211> 1561 

<212> DNA 

<213> Homo sapiens 



<400> 84 

ggcacgagtg aggctcatgt ctgacctgca gaactgtata atgataaatt atgttgtttg . 60 

aaaccgctac atttgcggta atttgttaca gcagcaatag aaaacgaatc ccctgcccag 120 

aatgacttcc tcctttcctg tcggacgaag gctcaggcct tctgctgaaa gcttgctccc 180 

ctaagtagtc acacccaatg ccgaatactc cccagaagca gctgctattt tctgaggaca 240 

atgagttgct tgtaagcctg agaacaggac gaaaacccac tttgcaagca gccctgcgtg 300 

tgacgggcat gccctcggag ggcaggttgg ttttgctatc tgctttctgt cc tgcctttt 360 

-toc^c^caJig^ggtP^ 4.2 OL 

actcttctca caggagaata gctgtatgga cgtagcaggg ggagttacca catgcctacc . 480 

tccatggttt tcgagagggg cccctgccca aatgtctcag tggccacctt catcagacca 540 

tggagcagtc agagcgggaa gggattctag agttggtcca gtccaaccat ctcatcttac 600 

atgtgaagga ggaaaggaag aaagggagaa aaataagaaa gctgaggtca accctcctac 660 

agggatgggc ctggccaaca ggatcccaag ggatgacata acattgaaat taagaaacca 720 

aggaaagttg agaactaaag aaaacagaac ccagtcagcc aagaggcatc cttgagggcc 780 

aaccaagccg tcaaacctgg atgcccccga cgagtcagaa agtcgggtgc ctcaagagcc 840 
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aagcagccaa gaaatggggt ctggaactgg cactttggtc cgcctctgtg cactcaccca 
gaaagggtgg aagggacc'ct gggaccaagt gccaaggtca cacaacggat gaatagactg 
ctggacttca aactgaacat gccattttgc caaagcagtc atcaccttcc gtgaatcata 
aatgtttgtt caaagccaca aatgtatata ctctttgtat gtatacagat tttttctaaa 
ggttaacatc taaacagatc aattaaggtc agccttaatt tgtctgagct ttttggttaa 
agtttcctga gtaattgagc gaattcaagt ttctggcttt ctcctttctc tttctccatt 
taaaacatga tctcatgaaa tttttgtccc aagaaaggca ggattacatt ttcttttaac 
agtttgagtt ggtgtagtgt attcttggtt atcagaatac tcatatagct ttgggatttt 
gaattggtaa atattcatga tgtgtgaaaa atcatgatac atactgtaca atctcagtgc 
cacaaaattg gatgttgtgc ctacacacgc acaggaccta gaagagcatg tcaaactata 
aactgcctgt gattgtgaat gactttgttc tttgcttctt gcgtttttca gtttcctata 
atgcacatct taacttttaa aaaataaagg. ttattttaaa agccaaaaaa aaaaaaaaaa 
a 



<210> 85 

<211> 1433 

<212> DNA 

<213> Homo sapiens 



<400> 85 

cccggagccg tggacgccct acagctgaga aggggaccca aggggtcggc cgcggccaag 60 

gcccctagga ccgccgcccc agctcacgct gccgacggca ttatkagaca ttctgcgtca 120 

ggtccgggct cctggacttc gcctttcccg agccctggag gtggggagaa aaggttcacc 180 

aatttttaaa atccaaatat atctcatggt acagtggaag aactggccag agagtctgga 240 

ag.tttgggtt ctggtcctgg : ctgtgccact gactcactgt gaccttggga tcttgtgctg 3 00 

tgaagacatt tcccaagtgc ttcatgttag ccagcaaatc tgacccacaa ggcctggaaa 3 60 

gaggtgattg ttaggttgcg cagaggtggt cttatccagc tcagcttccc ctgggaccca 42 0 

ccgtgggacc tgaggcagaa ctggggtgga cttggcctcc tccatggcac accggctgca 480 

gatacgactg ctgacgtggg atgtgaagga cacgctgctc aggctccgcc accccttagg 540 

ggaggcctat gccaccaagg cccgggccca tgggctggag gtggagccct cagccctgga 600 

acaaggcttc aggcaggcat acagggctca gagccacagc ttccccaact acggcctgag 660 

ccacggccta acctcccgcc agtggtggct ggatgtggtc ctgcagacct tccacctggc 720 

gggtgtccag gatgctcagg ctgtagcccc catcgctgaa cagctttata aagacttcag 7 80 

. ccacccctgc acctggcagg tgttggatgg ggctgaggac accctgaggg agtgccgcac 840 

acggggtctg agactggcag tgatctccaa ctttgaccga cggctagagg gcatcctggr 900 

gggccttggc ctgcgtgaac acttcgactt tgtgctgacc tccgaggctg ctggctggcc 960 

caagccggac ccccgcattt tccaggaggc cttgcggctt gctcatatgg aaccagtagt 1020 

ggcagcccat gttggggata attacctctg cgattaccag gggcctcggg ctgtgggcat 1080 

gcacagcttc "ctggtggttg gcccacaggc actggacccc gtggtcaggg 'attctgtacc 1140" 

taaagaacac atcctcccct ctctggccca tctcctgcct gcccttgact gcctagaggg 1200 

ctcaactcca gggctttgag gccagtgagg gaagtggctg gccctaggcc atggagaaaa 1260 

ccttaaacaa accctggaga cagggagccc cttctttctc cacagctctg gacctttccc 1320 

cctctcctgc ggcctttgtc acctactgtg ataataaagc agtgagtgct gagctctcac 1380 

ccttccccca ctaaaaaaaa aaaaaaaaaa actcgagggg gggcccggta ccc 1433 



<.210> 86 : 

<211> 1377 

<212> DNA 

<213> Homo sapiens 

<400> 86 

ggcacgaggt ccagtcctga ttccatcttc ttacaagtta gggagctggg tccaggcctg 
gatccatgtt. attatgaatc aggaagttgg gtccaggcct ggctccatgt tcctgcaggt 



60 
120 
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cagggcagg.t cttcccccga gtgatggctc ttggactgtg ctcctctggg gccctctcaa 180 
ctctgtgtct gtcatctgtc acctgcctgg ccattatggt tttgatggca gtggatgggc 240 
tccatgggac ttcaggcctg gggtgagact caggaccctg gggtgggcat ggatggggat 300 
attggacccc tgaaagaagg gaagctgaga gacttttttc ctttaaagac ttttccatgt 360 
tatctccact cagagaattc ttttctgcaa agtcacggga gggaggtgac attgagccct 420 
ccaatgtgac agaaactgtg ctgggaactt tacatgtgtt acctaatttg tttaattatc 480 
ccagcaactc cacaaagtag gcatttttat tgttgaggaa acagaagctt agagactt.tg 540 
tgagacttgc ccgagacccc aggtcacaca ccagcaagga tgaggtcaag cttttaatcc 600 
aggtctgcct ggctccaagt ccacaccctt tcacaacaat gaactttctt tatgattgca 660 
gatattattt ggggaacttt acatcaaaca ttgactacat aaaacttcaa ccatagacta 720 

tattctttgt tttggaaact gtgaagactc aaatttttta taaactcaga acagcttcca 780 

gttttctcta gatatcggaa gatgggctgt gttttttgtc tgttgtccag tgaggctgat 840 

ttgtagtcag acaggtgagt cagtttggtt ggagtaggct attgtggttc tctctcatca 900 

ggaaagaggg gatgcacttg gcccctcaac tccaagttgg tggtgcgatg atttttccat 960 

attctccctt aacaggctgt gagggagtct gggccaggca ctaggccatg agcagggcag 1020 

actggggtaa acccttagcg agcctctctc cagccacgag gaaacctgga gtgtgtgcgt 1080 

gcctgtgtgc tgctggtgtg tgtgtgtgaa tgcacacgtg tgtgcatgca ctgtgagctg 1140 

gtgtgtgcat gtgcactggt gtgtgcgttt gtgtgtgtgt gtgtgtgtgc atgtgtgtgc 12 00 

tgggtgcaca catgcatatg tctctgtgta tacatgtgta tgtgtgccag tgggtgcatg 1260 

tgtttgtaca gtgtgcgtgt gtgtgtgtgt ttgtgcacat gagctgctgc acacatataa 1320 

gccttgtgaa ttaggggaag aagaaaggct ccggcttaca aaaaaaaaaa aaaaaaa 1377 



<210> 87 
<211> 1715 
<212> DNA 

<213> Homo sapiens 



<400> 87 

ggcacgaggg acattggagc tccccacacc actcattgct gcccaccagc tatacaacta 60 

cgtggctgat cacgccagct cttaccacat gaagccattg cgaatggccc ggccaggggg 120 

cccagaacac aacgagtatg ccctggtgtc ggcatggcac agttctggct cctacctgga 180 

ctctgaggga cttcgacacc aggatgactt tgatgtgtct ctgcttgtct gtcactgtgc 240 

tgcacccttt gaggagcaag gagaggctga gcggcacgtt ctgcggctac agttcttcgt 300 

ggtgctcacc agccagcgag agctcttccc caggctcact gctgacatgc gccgcttccg 3 60 

gaagccaccc agactgcccc ctgagccaga ggctcctggg agttcagctg gcagccctgg 420 

ggaggcctca gggcttattc tagcgcctgg accggctcct ctgttcccac cactggctgc 480 

agaggtgggc atggcacgag cacggctggc tcagctggtg cggctggctg gagggcactg 540 

ccgtcgggac accctttgga agcgcctctt cttgctggag ccaccggggc ctgatcgact 600 

-gcggctaggg gggcgcctgg ccctggcaga gctggaggaa ctcctagaag cagtccatgc 660 

caaatccatt ggggacatcg acccccagct ggactgcttc ctatccatga cggtctcctg 720 

gtaccagagc ctgatcaaag ttctcctaag ccgcttcccc agagctgtcg ccatttccaa 780 

agcccagact tgggaactca gtacctggtt gcgctgaatc agaagttcac tgactgctct 840 

gcgctagtgt tctggactcc acttaggaaa gacgtctctg aagtggtttt ccgagaagcc 900 

cttccagtac agccccagga cacgagaagc ccccctgccc aactggtctc cacctaccac 960 

cacctggagt ctgtcatcaa cacagcctgt ttcacccttc tggacccgcc tcctctgaag 1020 

ggagtggact ggaccactga atgtcactgt tccttgaatc atgggcctac cagattgcct 1080 

gccagaggca ggactgacca gcccttctgg gccccagggc aagccagaca ctgagtgaca 1 140 

ccaaaggctt~tgtaa"ct'atg — tcttgagggtf ctgctgcccc agcctggcag caggaaccgc 1200 

cctccccaaa cacccacagc cactgaccca tccaggactc cagagagtca ggtcaacccc 1260 

gaggacccct tgggcccttc tggggtactc ctttcggccc ccctggtaga gtctcgggag 1320 

ttcacacagg gtggcaaaca ccccctagag ctcctctgcc tgaatcctgc cccctagcct 1380 

ttgaccactg tcagccacct gtgtcccttg agccttcggg tcttcacttc. ccacttggac 1440 

atcactgctg gacattccca tcgagatgac acctgggttc caatcccagc tctgcctttg 1500 

aagcacttgc ggccaccgtc aagtcccttt gctctcggac cctgggtttc tcatccttta 1560 
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atgaggtggg ttcagaagct ctcccatctt cacagcaacc ctggcactgg cttctcaatg 1620 
ggagggaagt cagcagagaa actgaagtgt tagacactat gtgtcccacc accccattac 1680 
agagacatat gacaatgaaa aaaaaaaaaa aaaaa 1715 



<210> 88 
<211> 417 
<212> DNA 

<213> Homo sapiens 
<400> 88 

ccacgcgtcc gctcctctag aggctccaca tgaagtccca gtgctacagt cctagttatt 
ttgccttctt ctgcctggtt ttctttcaga tcacctcagc cagttctcag acacttaggg 
gacatgttct ctgcaggacc actctgaggg actcttctgc atattgctga cctgagagga 
tggcctcaga gctgacttgg gcaatcctcc ccaacaggaa ggggagacat tgcctgccac 
tgaggaaaca ggtcatgaag gtggagataa gctgcaaggg gcgaagcaac tttatgtcag 
tggaaaacgt gtctctttaa agctgctatg tgaacagctt ttacagtcat taaatttacc 
taaactaagg ttaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaa 



<210> 89 
<211> 1167 
<212> DNA 

.<213> Homo sapiens 
<220> 

<221> SITE 
<222> (432) 

<223> n equals a,t,g, or c 
<400> 89 

gggggtgggg caggcgacgg tggggaagat ggcgtaccag agcttgcggc tggagtacct 
gcagatccca ccggtcagcc gcgcctacac cactgcctgc gtcctcacca ccgccgccgt 
gcagttggaa ttgatcacac cttttcagtt gtacttcaat cctgaattaa tctttaaaca 
ctttcaaata tggagattaa tcaccaactt cttatttttt gggccagttg gattcaattt 
tttatttaac atgatttttc tatatcgtta ctgtcgaatg ctagaagaag gctctttccg 
aggtcggaca gcagactttg tatttatgtt cctttttggt ggattcttaa tgaccctttt 
tggtctgttt gtgagcttag ttttcttggg ccaggccttt acaataatgc tcgtctatgt 
gtggagccga angaacccct atgtccgcat gaacttcttc ggccttctca acttccaggc 



60 
120 
180 
240 
300 
360 
417 



60 
120 
180 
240 
300 
360 
420 
480 



"cccctttctg ccctgggtgc tcatgggattT ttccttgttg ttggggaact caatcattgt 540 
ggaccttttg ggtattgcag ttggacacat atattttttc ttggaagatg tatttcccaa 
tcaacctggt ggaataagaa ttctgaaaac accatctatt ttgaaagcta tttttgatac 
accagatgag gatccaaatt acaatccact acctgaggaa cggccaggag gcttcgcctg 
gggtgagggc cagcggcttg gaggttaaag cagcagtgcc aataatgaga cccagctggg 780 
aaggactcgg tgatacccac tgggatcttt tatcctttgt tgcaaaagtg tggacacttt 
tgacagcttg gcagatttta actccagaag cactttatga aatggtacac tgactaatcc 
agaagacatt tccaacagtt tgccagtggt tcctcactac actggtactg aaagtgtaat 
_ttc.ttagagc _caraaaactg_gagaaaca*aa_tatcctgcca_cc.tc.taacaa_gtacatgagt — 
acttgatttt tatggtataa gcagagcctt ttcttcctct tcttgataga tgaggccatg 
gtgtaaatgg aagtttcaga gaggacaaaa taaaacggaa ttccattttt ctctcactgt 1140 
aaaaaaaaaa aaaaaaaggg cggccgc 1167 



600 
660 
720 



840 
900 
960 
1020_ 
1080 
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<212> DNA 

<213> Homo sapiens 



<400> 90 

ccacgcgtcc 

ccttccctca 

gactcaggcc 

actgctgact 

ccggaatccc 

tgctgtctgc 

ctaccgttgc 

tattctctca 

gatgacctcc 

gcctgagagg 

ccaggagcaa 

acaagaacac 

ggagggaaag 

gcagacagac 
tgctccccgg 
cattcgatca 
.gagaaaccaa 
tgctattcga 
atggaggagg 
.atgtctacct 
gccagcctgc 
ttgcctccca 
acgggctgga 
aaggctgtga 
atggggattc 
cttcaaaagc 
tctgcagaat 
atggagccag 
tgcccacacc 
aggctgcccc 
cagtcccaga 
tgttgatctt 



gcgggaccgg 
ctcctgaagg 
tccactccag 
ccaacctgga 
acactcgtcc 
tccaacctcc 
tccaaccacg 
cctaacactc 
cccatctcac 
ctcagcaaca 
gcgccagagc 
aagcaggaag 
caggaagaag 
tcagagccca 
gtacgagaag 
gccqaggaaa 
aaccctggca 
tcgtggagaa 
agatccttgg 
gtgccctctg 
agcggcaaca 
gagcctgtcc 
tttgtacggt 
agatgtccga 
cctaccaaga 
cagcagtgtc 
gagacttaca 
gagttcagca 
ccagcccaac 
ttctgggtct 
gagggccatg 
caaaaaaaaa 



acggatcttc 
tgctgctcct 
gcagccctct 
aggcagagac 
agctggacca 
cttatgcctc 
tctactatgc 
tcaaggagat 
cccacttcac 
acgtggaaga 
acaagcagga 
aggggcagaa 
gacaggggac 
agtttcactc 
tagagtctac 
tagatgaaat 
gcctcctgca 
tacctgcatc 
tttcgggaag 
tgacttctgc 
atgcgacaec 
atcggcaacc 
gggctccaca 
gtctctgggt 
tttgtgacac 
tgatgagaaa 
gtgcgctgag 
ccttgactct 
ctgcccacgt 
gttactcggc 
gtgggagtgc 
aaaaaaaaaa 



tccggccatg 

gcctctggca 

ctctcctacc 

tacctgccgt 

atatgaaaac 

ctggtttgag 

caagagagtc 

agaagcttca 

agtgacagaa 

gctcctacaa 

gcaaggagtg 

acaggaagag 

taaggaggga 

tgaatctcta 

tcctatgata 

gaatgaaata 

gctgccccac 

ataaccccca 

tcggtctgtg 

tccttgaagc 

.tcccacaaga 

aggtagggtc 

tggacttctg 

ggctccagac' 

agactatatc 

ccgcaatcgg 

ccctggcaaa 

aggccagttc 

tctctattgt 

ccctactcac 

gccctcctta 

aa 



aggaagccag 
cctgccgcag 
gaatacgaac 
ctccgtgcaa 
cacggcttag 
tctttctgcc 
ctgtgttccc 
gctgaagtct 
cgccagacct 
tcctccttgt 
gagcacaggc 
caagaagagg 
cgggaggctg 
tcttctaacc 
atggagaaca 
tatgatgaga 
acagagcctt 
cagccaaggc 
acagccttgg 
tggagcagtg 
ctccctttgc 
cccagaatca 
gtgtgcccgg 
tgagttcctt 
cagtacccaa 
aaggtgtccc 
agtgaggacg 
ggatgagctg 
tttgagaccc 
atttccttgg 
aaagatgact 



ccgctggctt 
cccaggattc 
gcttcttcgc 
cccacggctg 
tgcccgatgg 
agttcactca 
agccagtctc 
cacccaccac 
tccagccctg 
ccctgggaag 
aggagccgac 
aacaggaaga 
tgtctcagct 
cttcctcttt 
tccaggagct 
actcctactg 
gctggtgctg 
ctggaagtac 
gcggcgacac 
ccactcagag 
agccccttgc 
ggccgctttt 
cttgccacga 
agcttccagg 
actactgttc 
gcatgagatg 
ttgtgcttcg 
gcgtctattc 
cattgctttc 
gttggagcaa 
ttacataaaa 



60 
120 
• 180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1892 



<210> 91 

<211> 523 

<212> DNA 

<213> Homo sapiens 



<400> 91 
cacagcaaag 
gtgcgagtgc 
ctgtgggatg 
_ggc.tggaagg 
cccaacgtct 
tgctggctcc 
atccagcaga 
gagctaatgg 
ggccagtttg 



caagttctaa 
cccttgaagg 
gatgggctgg 
.tgcat.ttctc 
cctcggcagg 
gtgggtgctg 
tgtggtaggt 
tccccagatg 
gtggcatcct 



gagccaagct 
aggttttcta 
cacttgatgg 
_agac„ttcctt^ 
cttggtggtg 
agaggaaggg 
cctggccgct 
ctccaccgtc 
gggaaccgtt 



tcagaccaat 
acaggtgagt 
ctctccttcc 
_gcctgggaaa. 
gacgtgctgg 
ggaaggctgt 
tgctgacccg 
ctgatgtggc 
ttggaggctc 



cccccaccgt 
ggtctgattc 
ccctcacccy 
_t.ggga.ag_t.ga_ 
gccatgttcc 
ctattttttg 
tgggtctacc 
agaggcatgg 
gag 



gaagtccccc 
tgtctctgtc 
ccacggagaa 
_t gcagagga t 
aggggccagc 
gccaggatga 
gggtgctccg 
cattttggcg 



60 
120 
180 
240 



300 
360 
420 
480 
523 
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<210> 92 

<211> 1382 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> SITE 
<222> (1382) 

<223> n equals a,t,g, or c 



<400> 92 

gccggctggc agcacgactc gcgtagccgt gcgccgattg cctctcggcc tgggcaatgg 60 

tcccggctgc cggtcgacga ccgccccgcg tcatgcggct cctcggctgg tggcaagtat 120 

tgctgtgggt gctgggactt cccgtccgcg gcgtggaggt tgcagaggaa agtggtcgct 180 

tatggtcaga ggagcagcct gctcaccctc tccaggtggg ggctgtgtac ctgggtgagg 240 

aggagctcct gcatgacccg atgggccagg acagggcagc agaagaggcc aatgcggtgc 3 00 

tggggctgga cacccaaggc gatcacatgg tgatgctgtc tgtgattcct ggggaagctg 3 60 

aggacaaagt gagttcagag cctagcggcg tcacctgtgg tgctggagga gcggaggact 420 

caaggtgcaa cgtccgagag agccttttct ctctggatgg cgctggagca cacttccctg 480 

acagagaaga ggagtattac acagagccag aagtggcgga atctgacgca gccccgacag 540 

aggactccaa taacactgaa agtctgaaat ccccaaaggt gaactgtgag gagagaaaca 600 

ttacaggatt agaaaatttc actctgaaaa ttttaaatat gtcacaggac cttatggatt 660 

ttctgaaccc aaacggtagt gactgtactc tagtcctgtt ttacaccccg tggtgccgct 720 

tttctgccag tttggcccct cactttaact ctctgccccg ggcatttcca gctcttcact 780 

ttttggcact ggatgcatct cagcac'agca- gcctttctac caggtttggc accgtagctg 840 

ttcctaatat tttattattt caaggagcta aaccaatggc cagatttaat catacagatc 900 

gaacactgga aacactgaaa atcttcattt ttaatcagac aggtatagaa gccaagaaga 960 

atgtggtggt aactcaagcc gaccaaatag gccctcttcc cagcactttg ataaaaagtg 1020 

tggactggtt gcttgtattt tccttattct ttttaattag ttttattatg tatgctacca 1080 

ttcgaactga gagtattcgg tggctaattc caggacaaga gcaggaacat gtggagtagt 1140 

gatggtctga aagaagttgg aaagaggaac ttcaatcctt cgtttcagaa attagtgcta 1200 

cagtttcata cattttctcc agtgacgtgt tgacttgaaa cttcaggcag attaaaagaa 1260 

tcatttgttg aacaactgaa tgtataaaaa aattataaac tggtgtttta actagtattg 1320 

caataagcaa atgcaaaaat attcaataga aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 1380 

an 1382 



<210> 93 
<211> 1747 

-<212>'DNA " " -■ — 

<213> Homo sapiens 

<400> 93 

ccacgcgtcc ggctacctgt gcatcgtgct gctcatgctg ctgctgctca tcttctggat 60 

cgcgccggcc catgggccca ccaacatcat ggtctacatc agcatctgct ccttgctggg 120 

cagtttcacc gtgccttcca ccaagggcat cgggctggcg gcccaagaca tcttgcataa 180 

caacccgtcc agtcagagag ccctctgcct gtgcctggta ctcctggccg tgctcggctg 240 

cagcatcatc gtc c agttc a ggta catcaa ^aaggcgctg_gagtgcttcg_ actcct egg t 300 

gtteggggee atctactacg tcgtgtttac cacgctggtc ctgctggcct cagccatcct 3 60 

ettcegggag tggagcaacg tgggcctggt ggacttcttg gggatggcct gtggattcac 420 

gaccgtctcc gtggggattg tccttataca ggtgttcaaa gagttcaatt tcaaccttgg 480 

ggagatgaac aaatctaata tgaaaacaga etagattgea ataggagctt ggatggttcg 540 

aggaataggc attggaggtg gtttctggcc gtgattggat gtgaagtaga agaggtcctc 600 

gatcatggtg ttagaattga ctggatagta acaggtggtc tggtggatag eggggagcat 660 

ggctcagcac cagagcagag gcccagcagc ctctgcagcc caaacgtccc aacggtgcct 720 
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ggaccatctc ttctgatgag acgaatctca ttttcatttc cattaacctg gaagctttca 780 

tgaatatttc ttctttaaaa cattttaaca ttatttaaac agaaaaagat gggctctttc 840 

tgggtaggtg gtacatgata gcagagatat ttttacttag attactttgg gaatgagaga 900 

ttgtgtcttg aactctgcac tgtacaggat gtgtctgtag ttgtgttagt ttgcattaag 960 

catgtataca ttcaagtatg tcatccaaat aagaggcata tcattgaatt gtttttaatc 1020 

ctctgacaag ttgactcttc gacccccacc cccacccaag acattttaat agtaaataga 1080 

gagagagaga agagttaatg aacatgaggt agtgttccac tggcaggatg acttttcaat 1140 

agctcaaatc aatttcagtg cctttatcac ttgaattatt aacttaattt gactcttaat 1200 

gtgtatatgt tcttagatta gaataatgca acttcgagta tgctttaata tttcaatatt 12 60 

caagttacaa atgtataagg cagttagaaa taatacagtc acatgtcact taatgatagg 1320 

gaaacattct gagaaatgca ttgtaaggtg actttattgt gtgaacatca tggagtgcac 1380 

ttatacaaac ctagatggga cacctatgac ccacccaggc cagatggtac agcctgttgc 1440 

tcctgggcca cacacctgta cagcatgtga ctgcactgaa taccgcaggc aattgtaaca 1500 

cagtggtgag tatttgtgtt tacaaacata ggaaaggtac agtaaaacta tggtattaca 1560 

atgttatggg accaccgtca tgtaagtggt atgtctttga cagaaacatg gttacgtggt 1620 

tcatgactgt atattcactg gaagatagtc aagactaaag acacattaga gcaaattgac 1680 

ccctttaaca tgtgattatt gtccaattaa agacagttga tttaagtagc aaaaaaaaaa 1740 

aaaaaaa 1747 



<210> 94 
<211> 600 
<212> DNA 

<213> Homo sapiens 

<220> : ■ " ' ' ' 

<221> SITE 
<222> (553) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (560) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (589) 

<223> n equals a,t,g, or c 
<400> 94 

gaattcggca cgagcggcac gagccgagat cgttctgggg ctgctggtat ggacgcttat 60 
tgctggaact gagtacttcc gggtcccegc atttggctgg gtcatgtttg tagctgtatt 120 
ttactgggtc ctcaccgtst tcttcctcat tatctacata acaatgacct acaccaggat 180 
tccccaggtg ccctggacaa cagtgggcct gtgctttaac ggcagtgcct tcgtcttgta 240 
cctctctgcc gctgttgtag atgcatcttc cgtctcccct gagaaggaca gtcacaactt 300 
caacagctgg gcggcctcat cgttctttgc cttcctggtc accatctgct acgctggaaa 360 

tacata tttc agt tttaw a g cat ggagawc caggaccata _cag t ga tt ta_cca t t.t t ga t 4 2.0 

aattaaaagg aaaaaaaaag gaagactctc actgtaaaaa cagctgtagg tataatgtat 480 
attcccagag aattgtattt aactaattaa tgttttttat attcttaaat ttgctcacaa 540 
attgtggttt gtnacaattn aactgggtta ctttatttgg caagtgttnt aggcttttaa 600 



<210> 95 
<211> 586 
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<212> DNA 

<213> Homo sapiens 



<400> 95 

ggcacgaggt tttttccttt ataacggaag ttttataatt catcttttat gtaagtgtaa 60 

ttctcattaa aaatacccta aagcttaaag tttgcaaggc tgcccagcct aacccacaac 12 0 

agtttgatgc tgccccctag cgtttgattc ccttcacctt ttgctaaaat aaggtaatgt 180 

ttaaattaca attagattta cttactgctg taaatctggt ctattttagt ttcctctggg 240 

tagttagtgt tgctaataag atggacgtaa gtgtttttga actggtgaat tctgattgct 300 

tttagccccc agttttccaa ataggggtga attttgggta gagatagaac aatcaccaag 360 

ttaccttgct ccaaaaaaga aatttacgta tgggattgtt ttcaaagcgg gaagttagct 420 

gtgtaaataa caacaatttt atatatttaa tctgggcttc tccttatctt gaatgatata 480 

aaaatctact ttctagatta atttagttcc atataacttt gtattgcttt gactgtactg 540 

ataataaagt ttgaaagtgt taaaaaaaaa aaaaaaaaaa aaaaaa 586 



<210> 96 

<211> 802 

<212> DNA 

<213> Homo sapiens 



<400> 96 

ggcacgagcc ctcctccctg ctcgccccca gattcccctc ccctccctgg tgcttttgtc 60 

tggagggtgt tatgggtttg tgtgtgtatg agcgtgtgtg tgtttttgga tttcagacta 120 

attttctgga gtttctgccc ctgctct'gcg tcaccctcac gtcacttcgc cagcagtagc 180 

agaggcggcg gcggcggcte : ccggaattgg gttggagcag gagcctcgct ggctgcttcg 2 40 

ctcgcgctct acgcgctcag tccccggcgg tagcaggagc ctggacccag gcgccgccgg 3 00 

cgggcgtgag gcgccggagc ccgggtgagc agcgcagata gtgccctcgg tcgcctcggc 3 60 

cctcactgtc tccccctggg gcggcctcgg ctactcccca ggtgggacgt gccgcgccac 420 

ctgcccgcgc caccggcacc cagcggccgt ggcggattct gcagcatcat tcgggggccc 480 

cgtcgcggag ccaaagccgc cggcagtctc cgcattcccc tttaaagggt ccttcgcccg 54 0 

gcctgtacca tggaatcctg tcttggggac cctttcccta cctcccctcc cttggcctca 600 

ggctcgaaga gagagtgggc acactggtgg ctccagcggc gtcagtgcca tcgcggggca 660 

agttgattcc tgggcactca tccatccaca gtctccgggc tggggtcggg gtggggatga 720 
cgcgagcaga gagggagagt gccccaatta gtggtgttgg gggtcctacg ctcagtctta - 780 

cgcgtgtctg tttgtcctca gc 802 



<210> 97 
_ _ <211> 1226- 
<212> DNA 

<213> Homo sapiens - 



<400> 97 

ggcacgagca tgctttgctt acaatggagt ctgcagtgag gggagatgct gggatagcca 60 

tttccatggc tctgttatgc aagcacaaat ttcatctcct agatggactt cctggttttc 120 

tcttactgca gtaacactgg ccttcccttc tctaattcct taccccagct gcggcatccc 180 

tgtgttaact caggatgcca agtggccctc agattacact tctccagata gctga atgag 240 

t c t gc tt t"c a~ct"gtga^t^g^gacc^tgaatg acctgcagtc agggcccaga g 1 1 gggac t c 300 

tatactaccc tgggctctgg tctgtaggtt tgtagtagcc accggtaata agccaagggc 360 

taggctcttg tttgagttta tggccacctg gaattttcag tcatctcatg atacaggcgg 42 0 

gaggggcaga acagatagat tacgacaggt ttggttttta aattttccaa ccaagtggaa 480 

aggcaagttg gtcttataga aagcactact gcacttagta gctatgtgat tttgagcaaa 540 

ccacataatc tctctaggtc cattttccta accacaagat aaagatgtta cattgtcaaa 600 

gcttgccgta gatttggggt gaatgaaaat tattccttgc tttcatcact acctttatag 660 



BNSDOCID: <WO 9947540A1 _L> 



WO 99/47540 



PCT/US99/05804 



56 



ctctcatcac tacctttata gctcatcact gtgccttttt ttctttccta agaaagacat 720 
cacatccctc tcctctcctc ctctgtgctc ctgtccctcc ctccccctag caaggtccag . 780 

gcaaagctgg agatgaagct gaagatccag agtttcctag aacgcaactt aaggatggct 840 

aaggaaaggg aagcctgact gctcggtcag gagggtgcag tatctcttgc tgggaacaca 900 

gccagtttcc acaatgccta gactgtgtat gtctatttgc acaagattgg cttttcctat 960 

tttggagtgg tcagacattt tatttttgtt caagattatc tggcgtttta gacaaatttg 1020 

caaaactgtg cttttattga ctttttgaat aaactttggt attctggagc aaatgtattt 1080 

atttattggt atgtgcaatg acaaacttgg tatttttccc atgtttgaca tttatgttat 1140 

gtttgttaga attttagtgt ttgtctaagt acacacatat atcaacaaat taaacttgaa 1200 

tcgtttcaaa aaaaaaaaaa aaaaaa 1226 



<210> 98 

<211> 1120 

<212> DNA 

<213> Homo sapiens 



<400> 98 , 

aggggactct caccctctcc cagcaatgtc taaagtcagg catctgaaaa ccagcagtaa 60 

tcctgcctct gaagtttatc aggaaaggag cttaaaagag aaccaaattc agcctgtgtt 120 

ggaactctca gtcccagagg ggtgtggttt atagctctcc ggcctgctgt tggacttagg 180 

ctgtgaccca cagaaggacg ccagaaagta ctcaagacat tcacggtgcc ccggtcagca 240 

ctcgccatga cgaagacttc tacatgcata taccacttcc ttgttctgag ctggtatact 3 00 

ttcctcaatt attacatctc acaggaagga aaagacgagg tgaaacccaa aatcttggca 3 60 

aatggtgcaa ggtggaaata tatgacgctg cttaatctgc tcttgcagac cattttctac 420 

ggggtcacct gcctggatga tgtgotgaaa agaaccaaag ggggaaaaga cattaagttc 480 

ctaactgcct tcagagacct gcttttcacc actctggctt ttcctgtatc cacgtttgta 540 

tttttggcat tctggatcct ctttctctac aatcgagatc tcatttaccc caaggtccta 600 

gatactgtca tccccgtgtg gctgaatcat gcaatgcaca ctttcatatt ccccatcaca 660 

ttggctgaag tcgtcctcag gcctcactcc tatccatcaa agaagacagg actcaccttg 720 

stggctgctg ccagcattgc ttacatcagc cgcatcctat ggctctactt tgagacgggt 7 80 

acctgggtgt atcctgtgtt tgccaaactc agcctcttgg gtctagcagc tttcttctct 840 

ctcagctacg tcttcatcgc cagcatctac ctacttggag agaagctcaa ccactggaaa 900 

tggggtgaca tgaggcagcc acggaagaag aggaagtaat tgcacaccat tttccaagaa 960 

ccaagaaaga agaaaacaca agagattttt ctcatctttt tttttttttt tctggtggag 1020 

ggaggtggtg gaggaacata gcaaagtagg agggacagag agtgatactt aaatttaata 1080 

agaggttcgt gaaggtaaaa aaaaaaaaaa aaaactcgag 112 0 



-<210>-99 
<211> 2596 
<212> DNA 
<213> Homo sapiens 



<400> 99 

ccacgcgtcc gacttggcaa gcgttcacaa ccaaaatggc cagctctttc tggaagatat 60 

tgtaaaacgt gatggatttc cactatgggt tgggctctca agtcatgatg gaagtgaatc 120 

aagttttgaa tggtctgatg gtagtacat-t tgactatatc cc atggaaag gccaaaca tc 180 

tec t ggaaa t"tg tg t: t^c t c t "tggatccaaa aggaact tgg~ aaacatgaaa aa t gcaac t c 240 

tgttaaggat ggtgctattt gttataaacc tacaaaatct aaaaagctgt cccgtcttac 3 00 

atattcatca agatgtccag cagcaaaaga gaatgggtca cggtggatcc agtacaaggg 360 

tcactgttac aagtctgatc aggcattgea cagtttttca gaggecaaaa aattgtgttc 420 

aaaacatgat cactctgcaa etategttte cataaaagat gaagatgaga ataaatttgt 480 

gagcagactg atgagggaaa ataataacat taccatgaga gtttggcttg gattatctca 540 

acattctgtt gaccagtctt ggagttggtt agatggatca gaagtgacat ttgtcaaatg 600 
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ggaaaataaa agtaagagtg gtgttggaag atgtagcatg ttgatagctt caaatgaaac 660 

ttggaaaaaa gttgaatgtg aacatggttt tggaagagtt gtctgcaaag tgcctctggg 720 

ccctgattac acagcaatag ctatcatagt tgccacacta agtatcttag ttctcatggg 780 

cggactgatt tggttcctct tccaaaggca ccgtttgcac ctggcgggtt tctcatcagt 840 

tcgatatgca caaggagtga atgaagatga gattatgctt ccttctttcc atgactaaat 900 

tcttctaaaa gttttctaat ttgcactaat gtgttatgag aaattagtca cttaaaatgt 960 

cccagtgtca gtatttactc tgctccaaag tagaactctt aaatactttt tcagttgttt 1020 

agatcttagg catgtgctgg tatccacagt taattccctg ctaaatgcca tgtttatcac 1080 

cctaattaat agaatggagg ggactccaaa gctggaactg aagtccaaat tgtttgtaca 1140 

gtaatatgtt taatgttcat tttctctgta tgaatgtgat tggtaactag atatgtatat 1200 

tttaatagaa tttttaacaa aacttcttag aaaattaaaa taggcatatt actaggtgac 1260 

atgtctactt tttaattttt aagagcatcc ggccaaatgc aaaattagta cctcaaagta 1320 

aaaattgaac tgtaaactct atcagcattg tttcaaaata gtcattttta gcactgggga 1380 

aaaataaaca ataagacatg cttacttttt aatttttatt tttttgagac tgagtctctc 1440 

tctgttgccc aggctggagt acaatggcgt gatctcggct cactgcaaat ctccgcctcc 1500 

caggttcaag cgattctcct gcctcagcct cctgagtagc tgggattaca ggcaactgcc 1560 

accatgcccg gctaattttt gtatttttag tagagatggg gtttcaccat gttggccagg 1620 

ctggtctcga actcgtgacc gcaggtgatc ctcccgcctc ggcctcccaa agtgctggga 1680 

ttacaggcat gagccaccgc gcctggcctc tgcttacttt ttatatagca aaatgattcc 1740 

tcttggcaag atgtttctta tattattcca aagttatttc ataccattat tatgtaaata 1800 

tgaagagttt ttttctgttt ataattgttt ataaaacaat gacttttaaa gatttagtgc 1860 

ttaacatttt cccaagtgtg ggaacattat ttttagattg agtaggtacc ttgtagcagt 1920 

gtgctttgca ttttctgatg tattacatga ctgtttcttt tgtaaagaga atcaactagg 1980 

tatttaagac .tgataatttt acaatttata tgcttcacat agcatgtcaa cttttgacta 2040 

agaattttgt ttactttttt aacatgtgtt aaacagagaa agggtccatg aaggaaagtg 2100 

tatgagttgc atttgaaaaa : tgagactttt tcagtggaac tctaaacctt gtgatgacta 2160 

ctaacaaatg taaaattatg agtgattaag aaaacattgc tttgtggtta tcactttaag 2220 

ttttgacacc tagattatag tcttagtaat agcatccact ggaaaaggtg aaaatgtttt 2280 

attcagcatt taacttacat ttgtacttta gagtattttt gtataaaatc catagattta 2340 

ttttacattt agagtattta cactatgata aagttgtaaa- taattttcta agacagtttt 2400 

tatatagtct acagttgtcc tgatttctta ttgaatttgt tagactagtt ctcttgtctt 2460 

gtgatctgtg tacaatttta gtcactaaga ctttcctcca agaactaagc caacttgatg 2520 

tgaaaagcac agctgtatat aatggtgatg tcataataaa gttgttttat cttttaagta 2580 

aaaaaaaaaa aaaaaa 2596 



<210> 100 
<211> 1020 
<212> DNA 

<213>Homo sapiens 



<400> 100 

aaactagggg aaaatgtagc caacatatac aaagatcttc agaaactctc tcgcctcttt 60 

aaagaccagc tggtgtatcc tcttctggct tttacccgac aagcactgaa cctaccagat 12 0 

gtatttgggt tggtcgtcct cccattggaa ctgaaactac ggatcttccg acttctggat 180 

gttcgttccg tcttgtcttt gtctgcggtt tgtcgtgacc tctttactgc ttcaaatgac 2 40 

ccactcctgt ggaggttttt atatctgcgt gattttcgag acaatactgt cagagttcaa 300 

gacacagatt ggaagactgt a caggaagag gc a cataca a agaa a agaat c cccgaaagg 360 

gcggtttgtg atgctcctgc catcgtcaac tcacaccatt ccattctatc ccaacccctt 420 

gcaccctagg ccatttccta gctcccgcct tcctccagga attatcgggg gtgaatatga 480 

ccaaagacca acacttccct atgttggaga cccaatcagt tcactcattc ctggtcctgg 540 

ggagacgccc agccagtttc ctccactgag accacgcttt gatccagttg gcccacttcc 600 

aggacctaac cccatcttgc cagggcgagg cggccccaat gacagatttc cctttagacc 660 

cagcaggggt cggccaactg atggccggct gtcattcatg tgattgattt gtaatttcat 720 

ttctggagct ccatttgttt ttgtttctaa actacagatg tcaactcctt ggggtgctga 780 
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tctcgagtgt tattttctga ttgtggtgtt gagagttgca ctcccagaaa ccttttaaga 840 

gatacattta tagccctagg ggtggtatga cccaaaggtt cctctgtgac aaggttggcc 900 

ttgggaatag ttggctgcca atctccctgc tcttggttct cctctagatt gaagtttgtt 960 

ttctgatgct gttcttacca gattaaaaaa aagtgtaaat taaaaaaaaa aaaaaaaaaa 1020 



<210> 101 

<211> 1520 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> SITE 
<222> (71) 

<223> n equals a,t,g, or c 
<220> 

<221>. SITE 
<222> (473) 

<22 3> n equals a,t,g, or c 



<400> 101 

gcttttttct taagtgcaca aagcatcata ctccctggag gcaaacacat cgggctgctt 60 

cagcgttacg ngatgcttag cattttgaat attgtggcaa aaaaattaaa agttcactta 120 

ttaatattta tcagcagtat cataatttcc -atcctcttat ttcagaattt cacttgaggc 18,0 

aaaaatacca caagtgtaat tactcfeagca cag'ct'attaa tgtgctggat gataggccac 240 

tgcgtcacat gaccttctat tgttcatggg tttaaagaga aagcagggct ttgtatttct 300 

ttttcttctt ttaaagtcga ctgtagcatc ttggcttttg tctggggtgg ggaggatctg 360 

gggtctggtt cactttgtaa aagtaaacca tgtctgttta aacaatagag gtgtttaaga 42 0 

agactcttta gttttcctgc agattgttca agattacatg ataatcacac ggngtattta 480 

tttcctactg acaaaccaag tacttgttac atcaccaatg gtaccaggag atgaagacgc 540 

gggttttgag caggagcgag- attaccaccc aaaaagggag ctacctgagg cagcccagct 600 

tctagcaaac tttttacatg ttgcacattt cagttcttaa atgaaggcta ctccagtgtc 660 

atttcattaa agtacctggg tgtagtactc aagtcccccc tcaagagttc ataagtaagc 720 

agtatccttt tggccagtgg tcctgttttt gcccctac'cc agactgttcg rgaagcatat 780 

tctatagata aatctgacat ttgtcatcca ataccattgc agtcctctgc agcatacatt 840 

ctcaatgggg gctgtatcac ccctagattg gttctgagat actgcaatgt cttgtgtcct 900 

tccaaaggac cataatactt gagcaaatat gaacatttct tggggtgagg gcagaaagag 960 

agaaacaaaa gtctaaaaag ggacaataat gaaaaaacag ttgagacctt tagtatgatg 1020 

ggaacaggat gaggaaggag gagatactga~caggagccct gggtcttgct ctgcattaaa 1080" 

cagatattta tggacattaa acagatattt atggagcacg actctgtacc ctacaggccc 1140 

agaatagtct taaggctcct gggaattgat gataggccat ttacccagtt tcagtttaga 1200 

ggcagattca ctggccttag catttcagta attatattta tttattttta gcctgaacca 1260 

gatttaatag gagaaactac tttctgcgtt tcttttaatt acttgtagtt tacacagtaa 1320 

ctttagaaga gtaaatgaaa gcatgcttcg atgctgccac tgtaaatacc attcattagt 13 80 

aacttatttt ccctggagtc ttgtgaagtg tgaatttaaa gcctgctcta tctggaatat 1440. 

ggaatagtat taagattaca agcacatttt atattcatga gccggaaagg caaaaaaaaa 1500 

aaaaaaaaat gaccctcgag • 1520 



<210> 102 
<211> 1306 
<212> DNA 

<213> Homo sapiens 
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<220> 

<221> SITE 
<222> (1300) 

<223> n equals a,t,g, or c 



<400> 102 

aattcccggg tcgacccacg cgtccggaat ttaagggacc cacactacct tcccgaagtt 60 

gaaggcaagc ggtgattgtt tgtagacggc gctttgtcat gggacctgtg cggttgggaa 12 0 

tattgctttt cctttttttg gccgtgcacg aggcttgggc tgggatgttg aaggaggagg 180 

acgatgacac agaacgcttg cccagcaaat gcgaagtgtg taagctgctg agcacagagc 240 

tacaggcgga actgagtcgc accggtcgat ctcgagaggt gctggagctg gggcaggtgc 300 

tggatacagg caagaggaag agacacgtgc cttacagcgt ttcagagaca aggctggaag 360 

aggccttaga gaatttatgt gagcggatcc tggactatag tgttcacgct gagcgcaagg 420 

gctcactgag atatgccaag ggtcagagtc agaccatggc aacactgaaa ggcctagtgc 480 

agaagggggt gaaggtggat ctggggatcc ctctggagct ttgggatgag cccagcgtgg 540 

aggtcacata cctcaagaag cagtgtgaga ccatgttgga rgargaggar gaagaggagg 600 

aagaggaagg gggagacaag atgaccaaga caggaagcca ccccaaactt gaccgagaag 660 

atctttgacc cttgcctttg agcccccagg aggggaaggg atcatggaga gccctctaaa 72 0 

gcctgcactc tccctgctcc acagctttca gggtgtgttt atgagtgact ccacccaagc 780 

ttgtagctgt tctctcccat ctaacctcag gcaagatcct ggtgaaacag catgacatgg 840 

cttctggggt ggagggtggg ggtggaggtc ctgctcctag agatgaactc tatccagccc 900 

cttaattggc aggtgtatgt gctgacagta ctgaaagctt tcctctttaa ctgatcccac 960 

ccccacccaa aagtcagcag tggcactgga gctgtgggct ttggggaagt cacttagctc 1020 

cttaaggtct gtttttagac ccttccaagg aagaggccag aacggacatt ctctgcgatc 1080 

tatatacatt gcctgtatcc aggaggctac acaccagcaa accgtigaagg agaatgggac 1140- 

actgggtcat ggcctggagt ^tgctgataat ttaggtggga tagatacttg gtctacttaa 1200 

gctcaatgta acccagagcc caccatatag ttttataggt gctcaatttt ctatatcgct 1260 

attaaacttt tttctttttt tctaaaaaaa aaaaaaaaan actcga 1306 



<210> 103 
<211> 785 
<212> DNA 
<213> Homo sapiens 



<400> 103 

cttttagaag gtacgcctgc aggtaccggt ccggaattcc cgggtcgacc cacgcgtccg 60 

ggaaatgaac taccatttat aacttctgtt tttttattga gaaaatgatt cacgaattcc 120 

aaatcagatt gccaggaaga aataggacgt gacggtactg ggccctgtga ttctcccagc 180 

TcWcWtT^gct'aggfga gaggaaa^^tctttacttc cgcccctggc" "agggacttc t J 240 

gggttatggg agaaaccaga gatgggaatg aggaaaatat gaactacagc agaagcccct 300 

gggcagctgt gatggagccc ctgacattac tcttcttgca tctgtcctgc cttctttccc 360 

tctgcgaggc agtggggtgg gattcagagt gcttagtctg ctcactggga gaagaagagt 420 

tcctgcgcat gcaagccctg ctgtgtggct gtcgtttaca tttgggaggt gtcctgtatg 480 

tctgtacgtt ggggactgcc tgtatttgga agatttaaaa acctagcatc ctgttctcac 540 

cctctaagct gcattgagaa atgactcgtc tctgtatttg tattaagcct taacactttt 600 

cttaagtgca ttcggtgcca acatttttta gagctgtacc aaaacaaaaa gcctgtactc 660 

acatcacaat gtcattttga taggagcgfct ttgttatttt tacaa ggcag aat ggggtcrt 720 

aacagttgaa ttaaacttag caatcacgtg ctcaaaaaaa aaaaaaaaaa aaaaagggcg 780 

gccgc 785 



<210> 104 
<211> 2015 
<212> DNA 
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<213> Homo sapiens 
<220> 

<221> SITE 
<222> (3) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (9) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (1981) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (1990) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (2001) 

<223> n equals a,t7g, or c- 
<220> 

<221> SITE 
<222> (2002) 

<223> n equals a,t,g, or c 



<400> 104 

ccnggaatnc cgggtcgacc cacgcgtccg gcctgcgctg ccagcagcca ggagccagga 60 

gccaagagca gagcgccagc atgaacttgg gggtcagcat gctgaggatc ctcttcctcc 120 

tggatgtagg aggagctcaa gtgctggcaa caggcaagac ccctggggct gaaattgatt 180 

tcaagtacgc cctcatcggg actgctgtgg gtgtcgccat atctgctggc ttcctggccc 240 

tgaagatctg catgatcagg aggcacttat ttgacgacga ctcttccgac ctgaaaagca 300 

crcctggggg cctcagtgac accatcccgc taaagaagag agccccaagg cgaaaccaca 360 

atttctccaa aagagatgca caggtgattg agctgtaggt gagcagtgac gtgaagaggg 420 

gttctagccc cgtggaaaac agcccatggt taacatctca ggatgtcctg cattcaaaca 480 

cccaaggctg gtaatgaact ttcacatgga ctgaatattg gaggcaaata atagaaggaa 540 

tagaatatac agtgcctctg tcctgaagga aaatatcatg cctcttctgg aagaaacgga 600 

ctgcacagag gaaggattga gcaatttagc ctgcagtgga agaaggtgga caccaaaagc 660 

ttcaccctgt gttggagctg ttcatgcttc catgaggcca tggtgtccat gtccgtggaa 720 

cctaccacag aaaatggctc atgaaaaggg gaatccgacc caacacacag cttcctacac 780 

actgccatct tatcaacagt taggcactac tttgtagaac gattagcttc accctcttag 840 

cjtcrccaggag atcccttctt aaaqatgga c ta tqtgaaqa tLt cgggag.t c_c t gaaaca tg 900 

gggactccgg gatggtctct agccctatcg atgatgaaca ctggccttct ggaggggaaa 960 

tggcagtctg ggctggcgtg gtaggaaggg ctttggtgtt catggaatgg gcctgctgct 1020 

ctcagacctt caaaggatgg aaccaacgaa ggaccaaatg agaaagcaga tgcttgcctt 1080 

gcagagggcc atgaatgtca gttattattt ttctccttat acaattattt tgtggttatt 1140 

attacaatgt acatggctgt tgcatagaag acatgactgg tggaggctga ggaaagccat 1200 

gacattctac aattgccatc aggctaaggc cccgtgagca tttctctccc ttgtaatatt 1260 

aaccctgtat ttctgggatc acatcacgga atattctttg cctttccact ttccaggaaa 1320 
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tctctcggac tgggctaccc tccttgtgtg tgatgaaaga tgagctatat ttcagaacaa 13 80 

agtgctgtgt tgtcatratt tgcctggact cccagggcgt ctcttaccca acttgataac 1440 

gatgctgttc attagcagcc tttgttaact gataaccaag agcggtaatg tgatactcat 1500 

aagcaatttt ctgtgtgtag gataaaataa accatcttgt atgggaaaaa aaaaaaaaaa 1560 

aaaaaaaaaa aaaaagggcg gccgctctag aggatccaag cttacgtacg cgtgcatgcg 1620 

acgtcatagc tcttctatag tgkcacctaa attcaattca ctggccgtcg ttttacaacg 1680 

tcgtgactgg gaaaaccctg gcgttaccca acttaatcgc cttgcagcac atcccccttt 1740 

cgccagctgg cgtaatagcg aagaggcccg caccgatcgc ccttcccaac agttgcgcag 1800 

cctgaatggc gaatgggacg cgccctgtag cggcgcatta agcgcggcgg gtgtggtggt 1860 

tacgcgcagc gtgaccgcta cacttgccag cgccctagcg cccgctcctt tcgctttctt 192 0 

cccttccttt ctcgccacgt tcgccgggtt tccccgtcaa gctttaaatc gggggcttcc 19 80 

nttaagggtn ccaattaagg nnttaccggg acctt 2015 



<210> 105 
<211> 367 
<212> DNA 

<213> Homo sapiens 



<400> 105 

cggcacgagt gtaaatgtca ccaccaaagg tttgcaccct gatcaaaaag agtatgaaaa 60 

gaataatacc acaacactta tggcctgtct tggaggcctt ctggggatta ttggtgtgat 120 

atgtcttatc agctgcctct ctccagaaat gaactgtgat ggtggacaca gctatgtgag 180 

gaattactta cagaaaccaa cctttgcatt aggtgagctt tatcctcctc tgataaatct 240 

ctgggaagca ggaaaagaaa aaagtacatc actgaaagta. aaagcaactg ttataggttt 300 

accaacaaat atgtcctaaa aaccaccaag gaaacctact ccaaaaatga aaaaaaaaaa 3 60 

aaaaaaa 3 67 



<210> 106 
<211> 1889 
<212> DNA 

<213> Homo sapiens 
<400> 106 

ctcatccttc tatcatcata tggagtggca ataatgaaaa 
attggtatca tatcagtttc actgaccggc caatctacat 
atgtgaaaaa catcagagag ctcgtactgg caggagacaa 
ccagtcctac aaatggggct gaaactgttg cagaagcctg 
gcaattat tt tggtgatgta catttttatg-actatatcag 
ttttcccaaa agctcgattt gcatctgaat atggatatca 
cattagaaaa ggtctcgtct acagaggact ggtctttcaa 
gacaacatca cgaaggtggt aacaaacaaa tgctttatca 
tcccccaaag cacagatcca ttacgcacat ttaaagatac 
tgcaggccca gtgtgtcaaa acagaaactg aattctaccg 
tggatcagca agggcacacg atgggggcac tttattggca 
ctccttcctg ggcttctctt gatacggagg aaagtggaaa 
gaatttcttt gctccactgt tgccagtagc tttgaaatga 
" gtgtgtcaga tcttcactcg gattattcga tgacactcag 
gctccctgga gcccgtgtgc tctcgtgtga ctgaacgttt 
ctgtctgcct ttatgaggag ccagtgtctg aattgctgag 
gggaaagctg tgtggtttcc ttttaccttt cagctgacca 
actaccactt cctgtcctca ccgaaggagg ccgtggggct 
ccatcatctc tcagcaaggt gacatatttg tttttgacct 
cctttgtttg gttggatgta ggaagcatcc cagggagatt 



tgaggaggcg ctgatgatga 60 
caaggactat gtgacactct 12 0 

gagtcgtcct tttattacgt 180 
ggtctctcaa aaccctaata 240 
tga t tgc tgg~ aactggaaag 3 0 0" ~ 

gtcctggccg tccttcagta 360 
tagcaagttt tcacttcatc 42 0 

ggctggactt catttcaaac 480 
catctacctt actcaggtga 540 
ccgtagtcgc agcgagatag 600 
gttgaatgac atctggcaag 660 
atgcttcatt actttgctca 720 

aaa c atgttc ta ta tct at g 78 0 

tgtgagagtc catacatgga 840 
tgtgatgaaa ggaggagagg 900 
gagatgtggg aattgcacac 960 

tgaactcctg agcccgracca 1020 

ctgcaaggcg cagatcactg 1080 

ggagacctca gctgtcgctc 1140 

tagtgacaat ggtttcctca 1200 
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tgactgagaa gacacgaact atattatttt acccttggga gcccaccagc aagaatgagt 1260 

tggagcaatc ttttcatgtg acctccttaa cagatattta ctgaaggaat ctaggttgta 1320 

ttttcagtgg acaatgggaa taaagcattt ctaaagcacc gactggagag gaaggcaaca 1380 

gagacaagga gagaagccga gagacatgtc tgcgtgctgc cacgcatctg agcgattgct 1440 

ctgtgaagag ttgtacactg aacattttca ggggaggctg tttacccagg caatgtcctc 1500 

aaacaagcct gtgccggggt . gtcctggaat ctgtgccagg actgtgtttt tagcccttca 1560 

cctctcagct ttagcaggac atgaaccagt tataacaaga tggccctgca gctggttaca 162 0 

agaatgtgac atggcaggat ctatggaacc aaatggaagg ttttgaggtg atgtaggtct 1680 

ttcacagtta gctttgggga atacagaata ctcaaataaa gtgcttt'gtt attatttcag 1740 

agggaatggc gattgaaatg ttacaacaga gatttcttgg tggtagctat ttgggtaaag 1800 

gtatatggat atttttctgt acatgtgaaa ttatataaaa ataaaagtta tataaattac 1860 

attgaaaaaa aaaaaaaaaa aaaaaaaaa 1889 



<210> 107 

<211> 1201 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> SITE 
<222> (1086) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (1161) 

<223> n equals a,t,g. or c 
<220> 

<221> SITE 
<222> (1176) 

<223> n equals a,t,g, or c 



<400> 107 

cggcacgagc ggctggcagc acgactcgcg taccgtgcgc cgattgcctc tcggcctggg 60 

caatggtccc ggctgccggt cgacgaccgc cccgcgtcat gcggctcctc ggctggtggc 120 

aagtattgct gtgggtgctg ggacttcccg tccgcggcgt ggagggacct tatggatttt 180 

ctgaacccaa acggtagtga ctgtactcta gtcctgtttt acaccccgtg gtgccgcttt 240 

tctgccagtt tggcccctca ctttaactct. ctgccccggg cat ttccagc tcttcacttty — - -300 * ~ 

ttggcactgg atgcatctca gcacagcagc ctttctacca ggtttggcac cgtagctgtt 3 60 

cctaatattt tattatttca aggagctaaa ccaatggcca gatttaatca tacagatcga 42 0 

acactggaaa cactgaaaat cttcattttt aatcagacag gtatagaagc caagaagaat 480 

gtggtggtaa ctcaagccga ccaaataggc cctcttccca gcactttgat aaaaagtgtg 540 ' 

gactggttgc ttgtattttc cttattcttt ttaattagtt ttattatgta tgctaccatt 600 

cgaactgaga gtattcggtg gctaattcca ggacaagagc aggaacatgt ggagtagtga 660 

tggtctgaaa gaagttggaa agaggaactt caatccttcg tttcagaaat tagtgctaca 720 

gtttcataca ttttctccag tgacgtgt-tg acttgaaact tcaggcagat taaaagaatc 780 

atttgltgaa-caaetgaatg-feataaaaaaa- ttataaactg gt _ getTtta"a^ ta^Fa^tgca 8T0 

ataagcaaat gcaaaaatat tcaatagatg cactattctt gtttttactg catgmacgta 900 

atccagtatt tggkaaagta atccaktttg aaatgtgrag rtgtattccg gcagaatagt 960 

gagtagaatg acagcttact atacagaagg cmaaaatagg actctcaggt aatagtttaa 102 0 

ggaaaccctt gattccttat gatgatgttt aagaaaggtt agttttctgt ttctttgcca 1080 

gttttncttc taggagtcca tagccaggga aagtatgtga accagaattg gttagtgtga 1140 

ccccctccaa gtagccagtg ntgggaaata agggtncaat accttgatgt ttgtgatctc 12 00 
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<210> 108 

<211> 75 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> SITE 
<222> (75) - 

<223> Xaa equals stop translation 
<400> 108 

Met Asp Pro Leu Cys Leu Pro lie lie Leu Phe Ser Ala Val Val Leu 
1 ' 5 10 15 

Arg Asn Leu Phe His Leu Leu lie Leu Thr Phe His Tyr Leu Pro Leu 
20 25 30 

Phe Cys Asp Asn Pro Leu lie Leu Glu Asp Leu Ser Cys lie His Leu 
35 40 45 

Arg Val Asn lie Phe Lys Ala Lys Gin Pro Lys Phe Tyr Gly Asn Gin 

50 . .55 ' " _ , GO 

Leu Gin Pro Cys Val Met Lys Ser Ser Ala Xaa 
65 70 75 



<210> 109 

<211> 202 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> SITE 
<222> (202) 

<223> Xaa equals stop translation 
<400> 109 

Met Lys Leu Leu lie Leu Phe Leu Ser His Leu Leu Ser Leu Ala Phe 
1 5 10 15 

Gly lie Leu Cys Leu Ser Val Thr Val lie Leu Ser Leu Leu Leu Ser 
20 25 30 

Phe Ser Lys Arg Gly Phe Ser Val -Arg Ser Phe Gly Thr Gly Thr His 

35 : 40 45 



Val Lys Leu Pro Gly Pro Ala Pro Asp Lys Pro Asn Val Tyr Asp Phe 
50 55 60 



Lys Thr Thr Tyr Asp Gin Met Tyr Asn Asp Leu Leu Arg Lys Asp Lys 
65 70 75 80 
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Glu Leu Tyr Thr 



Arg lie Lys Pro 
100 

Asp Leu lie Leu 
115 

Asp Leu Asn Ser 
130 

Asn Val Asp lie 
145 

Leu lie Cys Glu 



Asn Glu lie Asp 
180 

Thr Phe Leu His 
195 



Gin Asn Gly lie 
85 

Arg Pro Glu Arg 



Thr Cys Glu Glu 
120 

Arg Glu Gin Glu 
135 

Gin Asp Asn His 
150 

Leu Cys Gin Cys 
165 

Glu Leu Leu Gin 



Thr Val Cys Phe 
200 



Leu His Met Leu 
90 

Phe Gin Asn Cys 
105 

Arg Val Tyr Asp 



Thr Cys Gin Pro 
140 

Glu Glu Ala Thr 
155 

lie Gin His Thr 
170 

Glu Phe Glu Glu 
185 

Tyr Xaa 



Asp Arg Asn Lys 
95 

Lys Asp Leu Phe 
110 

Gin Val Val Glu 
125 

Val His Val Val 



Leu Gly Ala Phe 
160 

Glu Asp Met Glu 
175 

Lys Ser Gly Arg 
190 



no 

371 
PRT 

Homo sapiens 
<220> 

<221> SITE 
<222> (31) 

<223> Xaa equals any of the naturally occurring L-amino acids 
<220> 

<221> SITE 
<222> (193) 

<223> Xaa equals any. of __the naturally- occurring L-amino acids 
<400> 110 

Met- Gly Leu Lys Leu Leu Gin Lys Pro Gly Ser Leu Lys Thr Leu lie 
15 10 15 

Ala lie lie Leu Val Met Tyr lie Phe Met Thr lie Ser Val Xaa Cys 
20 25 30 

Trp_Asn- -Trp-Lys— Va-1— Phe— Pro-Lys^ Ala— Arg~Phe~~AIa~Ser~ Glu - Tyr~Gly 

35 40 45 

Tyr Gin Ser Trp Pro Ser Phe Ser Thr Leu Glu Lys Val Ser Ser Thr 
50 55 60 

Glu Asp Trp Ser Phe Asn Ser Lys Phe Ser Leu His Arg Gin His His 



<210> 
<211> 
<212> 
<213> 
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65 70 75 80 

Glu Gly Gly Asn Lys Gin Met Leu Tyr Gin Ala Gly Leu His Phe Lys 
85 90 95 

Leu Pro Gin Ser Thr Asp Pro Leu Arg Thr Phe Lys Asp Thr lie Tyr 
100 105 110 

Leu Thr Gin Val Met Gin Ala Gin Cys Val Lys Thr Glu Thr Glu Phe 
115 120 125 

Tyr Arg Arg Ser Arg Ser Glu lie Val Asp Gin Gin Gly His Thr Met 
130 135 • 140 

Gly Ala Leu Tyr Trp Gin Leu Asn Asp lie Trp Gin Ala Pro Ser Trp 
145 150 155 160 

Ala Ser Leu Glu Tyr Gly Gly Lys Trp Lys Met Leu His Tyr Phe Ala 
165 170 175 

Gin Asn Phe Phe Ala Pro Leu Leu Pro Val Gly Phe Glu Asn Glu Asn 
180 185 190 

Xaa Phe Tyr lie Tyr Gly Val Ser Asp Leu His Ser Asp Tyr Ser Met 

195 200.. -• _. . "" . 20.5. 

Thr Leu Ser Val Arg Val His Thr Trp Ser Ser Leu Glu Pro Val Cys 
210 215 220 

Ser Arg Val Thr Glu Arg Phe Val Met Lys Gly Gly Glu Ala Val Cys 
225 230 235 240 

Leu Tyr Glu Glu Pro Val Ser Glu Leu Leu Arg Arg Cys Gly Asn Cys 
245 250 255 

Thr Arg Glu Ser Cys Val Val Ser Phe Tyr Leu Ser Ala Asp His Glu 
260 265 270 

Leu Leu Ser Pro Thr Asn Tyr His Phe Leu Ser Ser Pro Lys Glu Ala 

275 ... - - 280 - . - — - 285 " " 

Val Gly Leu Cys Lys Ala Gin lie Thr Ala lie lie Ser Gin Gin Gly 
290 295 300 

Asp lie Phe Val Phe Asp Leu Glu Thr Ser Ala Val Ala Pro Phe Val 
305 310 315 320 

Trp Leu Asp Val Gly Ser lie Pro- Gly Arg Phe Ser Asp Asn Gly_Phe 



325 330 ~ 335 

Leu Met Thr Glu Lys Thr Arg Thr lie Leu Phe Tyr Pro Trp Glu Pro 
340 345 350 

Thr Ser Lys Asn Glu Leu Glu Gin Ser Phe His Val Thr Ser Leu Thr 
355 360 365 
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Asp lie Tyr 
370 



<210> 111 
<211> 114 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (38) - 

<22 3> Xaa equals any of the naturally occurring L-amino acids 
<220> 

<221> SITE 
<222> (114) 

<223> Xaa equals stop translation 
<400> 111 

Met Arg Pro Leu Leu Leu Gly Gly Tyr Trp Val Leu Cys Leu Ser Val 
• 1 5 10 15 

Leu Gly His Ala Ala Leu Tyr His Phe Trp Leu Arg Giu Glu Gly Lys 

20 - • " 25 ' '30 

Gly Pro Pro Gin Val Xaa Ser Val Leu Ala Leu Ala Leu Pro Ala Gly 
35 40 45 

Ser Cys Ala Pro Gly Leu Pro Phe Pro Gly Pro Leu He Pro Thr Gin 
50 55 60 

Leu Leu Phe Ala Leu Glu Trp Gly Thr Pro Thr Pro Leu Arg Asp His 
65 70 75 80 

Pro Pro His Ser Met His Ser Ala Pro Gin Asn Pro Pro Val Phe Leu 
85 90 95 

Gly Thr His Thr Cys' Pro Pro "Ser" Trp Tyr "Phe Arg Leu "lie Pro Gin 
100 .105 110 

Ala Xaa 



<210> 112 

<211> 152 ^_ 

<2r2>~PRT 

<213> Homo sapiens 

<220> 

<221> SITE 
<222> (152) 

<223> Xaa equals stop translation 
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<400> 112 

Met . Arg Arg Leu Leu Leu Val Thr Ser Leu Val Val Val Leu Leu Trp 
15 10 15 

Glu Ala Gly Ala Val Pro Ala Pro Lys Val Pro lie Lys Met Gin Val 
20 25 30 

Lys His Trp Pro Ser Glu Gin Asp Pro Glu Lys Ala Trp Gly Ala Arg 
35 40 45 

Val Val Glu Pro Pro Glu Lys Asp Asp Gin Leu Val Val Leu Phe Pro 
50 55 60 

Val Gin Lys Pro Lys Leu Leu Thr Thr Glu Glu Lys Pro Arg Gly Gin 
65 70 75 80 

Gly Arg Gly Pro lie Leu Pro Gly Thr Lys Ala Trp Met Glu Thr Glu 
85 90 95 

Asp Thr Leu Gly Arg Val Leu Ser Pro Glu Pro Asp His Asp Ser Leu 
100 105 110 

Tyr His Pro Pro Pro Glu Glu Asp Gin Gly Glu Glu Arg Pro Arg Leu 
115 _ . . 120. _ "" 125 

Trp Val Met Pro Asn His Gin Val Leu Leu Gly Pro Glu Glu Asp Gin 
130 135 140 

Asp His lie Tyr His Pro Gin Xaa 
145 150 



<210> 113 
<211> 56 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SITE - --- -- — ■— — 

<222> (56) 

<223> Xaa equals stop translation 
<400> 113 

Met Pro Cys Gly Lys Phe Leu Phe Pro Val Ser Pro Val Ser Ser Leu 
1-5 10 15 

Ser Leu His Trp Ser Ala Val Leu- Leu Leu Leu Leu Ala Asp Phe Pro 



20 25" 30 

Arg Val His Gly Ser Pro Pro Gly Val Ser Arg Val Ser lie Leu His 
35 40 45 

Cys Leu Phe Pro Phe Leu Ser Xaa 
50 55 
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<210> 114 
<211> 237 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> SITE 

<222> (237) 

<223> Xaa equals stop translation 

<400> 114 

Met Glu Val Arg Leu He Phe Leu Ser Gly Leu Cys He Ala Val Ala 
1 .5 10 15 

Val Val Trp Ala Val Phe Arg Asn Glu Asp Arg Trp Ala Trp He Leu 
20 25 30 

Gin Asp He Leu Gly He Ala Phe Cys Leu Asn Leu He Lys Thr Leu 
35 40 45 

Lys Leu Pro Asn Phe Lys Ser Cys Val He Leu Leu Gly Leu Leu Leu 
50 55 60 

Leu Tyr Asp Val Phe Phe Val Phe lie Thr Pro Phe lie Thr Lys Asn 
65 70 75 80 

Gly Glu Ser He Met Val Glu Leu Ala Ala Gly Pro Phe Gly Asn Asn 
85 90 95 

Glu Lys Leu Pro Val Val He Arg Val Pro Lys Leu He Tyr Phe Ser 
100 105. 110 

Val Met Ser Val Cys Leu Met Pro Val Ser He Leu Gly Phe Gly Asp 
115 120 125 

He He Val Pro Gly Leu Leu He Ala Tyr Cys Arg Arg Phe Asp Val 
130 135 _140 

Gin Thr Gly Ser Ser Tyr He Tyr Tyr Val Ser Ser Thr Val Ala Tyr 
145 150 155 160 

Ala He Gly Met He Leu Thr Phe Val Val Leu Val Leu Met Lys Lys 
165 170 175 

Gly Gin Pro Ala Leu Leu Tyr Leu Val Pro Cys Thr Leu He Thr Ala 
180 "185 19.0 ^: 



Ser Val Val Ala Trp Arg Arg Lys Glu Met Lys Lys Phe Trp Lys Gly 
195 200 205 



Asn Ser Tyr Gin Met Met Asp His Leu Asp Cys Ala Thr Asn Glu Glu 
210 215 220 
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Asn Pro Val lie Ser Gly Glu Gin lie Val Gin Gin Xaa 
225 230 235 



<210> 115 

<211> 44 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> SITE 
<222> (44) 

<223> Xaa equals stop translation 
<400> 115 

Met Phe Cys Phe Tyr Leu His Phe lie Phe His Val Leu Ser Tyr Lys 
15 10 15 

Leu Asn Pro Leu Leu Phe Phe Ser Cys Ser Cys Phe Cys Phe lie Leu 
20 .25 30 

Val Phe Leu Phe Pro Asp Tyr His Leu Gly Met Xaa 
35 40 



<210> 116 .... 

<211> 65 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> SITE 
<222> (65) 

<223> Xaa equals stop translation 
<400> 116 

Met Val Arg His lie Arg Glu Arg Arg Arg Gin Pro Leu Ala Phe Gin 
1. 5' 10 15 



Arg Val- Leu Leu Ser-Leu^Cys Leu ~ Leu "Glu Gly lie Trp "His Ser Pro 
20 25 30 

Ala Ala Ala Ala Gly Gly Gly Ser His Cys Ser Ser Trp Pro Ser Leu 
35 40 45 

Tyr Thr Thr Phe Gin Arg Val Ser Leu Leu Glu Leu Asp Leu Gly Leu 
50 55 60 

"Xaa — 
65 



<210> 117 
<211> 118 
<212> PRT 
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<213> Homo sapiens 



<220> 

<221> SITE 

<222> (118) 

<223> Xaa equals stop translation 

<400> 117 

Met Ala Arg Ser Ala Leu Arg Leu Glu lie Leu Gly Gin Leu Leu Val 
1 5 . 10 15 

Gly Val Ser Ser Cys Cys Ala Glu lie Arg Ser Arg Ser Tyr Leu Gly 
20 25 30 

Phe Cys Trp Lys Asn lie Gin Asp Glu Arg Lys Lys Lys lie lie Leu 
. 35 40 45 

Arg Gly Ser Arg Asn Leu Leu Cys Pro Arg Leu Leu Arg Pro Leu Glu 
50 55 60 

Pro Val Gin Ala Lys Gly Thr Gin Ser Val Asp Pro Arg Glu Val Val 
65 70 75 80 

Arg Glu Thr Arg Ser Met Ser Thr Leu Pro Ala Asp Phe Cys Leu Leu 

85_. , _ 90 . 95 

Pro Gin Ala Ser Arg Met Ala Gin Lys Gly Ser Pro Ser Arg Ser Ser 
100 105 110 



Leu Gin Leu Leu Phe Xaa 
115 



<210> 118 

<211> 65 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> SITE - - - - - " - 

<222> (65) 

<223> Xaa equals stop translation 
<400> 118 

Met Thr Val Ser Leu Phe Leu Leu Leu Ala Thr Ser Gin Ser Gin Asp 
15 10 15 

Gly Cys Cys Asp Ser Gly Ser Cys 'Pro _Asn Ser Ar g Gin Gin G lu Gly 
7 20 " 25 30 

His Gly Ala Ala Pro Ala Ser Arg Cys Pro Cys Arg Pro Ser Leu Gin 
35 40 45 



Ala Gin Glu Pro Lys Glu Glu Ser Thr Gin Met Trp Cys Ser His Leu 
50 55 60 
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Xaa 
65 



<210> 119 

<211> 43 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> SITE 
<222> (43) 

<223> Xaa equals stop translation 
<400> 119 

Met Leu Lys Trp Thr Gly Phe Leu Val Val Leu Val Ala Phe Lys Lys 
1 5 10 15 

lie Ser Ala Ser Phe Gin Val Asn Tyr Asn Leu Lys Phe Glu lie Ser 
20 25 30 

Phe Gly Glu Pro Trp Lys Phe Thr Gin Trp Xaa 
35 ' 40 



<210> 120 

<211> 48 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> SITE 
<222> (48) 

<223> Xaa equals stop translation 
<400> 120 

Met Ser Phe Gly lie Ser lie His Thr Cys Thr Tyr Leu lie Phe lie 
1 5 10 15 



Ala Phe His Phe lie Ala Leu Cys Lys Val Thr Phe Phe Thr Asp Ser 
20 25 30 

Arg Phe Gly Asn Pro Met Ser He Ser Leu Ser Ala Pro Phe Phe Xaa 
35 40 45 



<210> 121 
<211> 140 
<212> PRT 
<213> Homo sapiens 
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<220> 

<221> SITE 
<222> (140) 

<223> Xaa equals stop translation 
<400> 121 

Met Ala Leu Gly lie Gin Lys Arg Phe Ser Pro Glu Val Leu Gly Leu 
1 5 10 15 

Cys Ala Ser Thr Ala Leu Val Trp Val Val Met Glu Val Leu Ala' Leu 
20 25 30 

Leu Leu Gly Leu Tyr Leu Ala Thr Val Arg Ser Asp Leu Ser Thr Phe 
35 40 45 

His Leu Leu Ala Tyr Ser Gly Tyr Lys Tyr Val Gly Met lie Leu Ser 
50 55 . 60 

Val Leu Thr Gly Leu Leu Phe Gly Ser Asp Gly Tyr Tyr Val Ala Leu 
65 '70 75 80 

Ala Trp Thr Ser Ser Ala Leu Met Tyr Phe lie Val Arg Ser Leu Arg 
85 90 95 

Thr Ala Ala Leu Gly^Pro.Asp Ser . Met Gly Gly Pro, Val Pro Arg Gin 
100 " . • "• • .105 110 

Arg Leu Gin Leu Tyr Leu Thr Leu Gly Ala Ala Ala Phe Gin Pro Leu 
115 120 125 

lie lie Tyr Trp Leu Thr Phe His Leu Val Arg Xaa 
130 135 140 



<210> 122 

<211> 92 

<212> PRT 

<213> Homo sapiens 

<220> . - - — — - --- - - 

<221> SITE 
<222> (89) 

<223> Xaa equals any of the naturally occurring L-amino acids 
<220> 

<221> SITE 
<222> (92) 

<223> Xaa equals stop translation 



<400> 122 

Met Met Asp Phe Leu Arg Cys Val Thr Ala Ala Leu lie Tyr Phe Ala 
15 10 15 

lie Ser lie Thr Ala lie Ala Lys Tyr Ser Asp Gly Ala Ser Lys Ala 
20 25 30 
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Ala Gly Gly Ser Val Pro Asp Thr 
35 40 

Glu Met Gly Arg Glu Leu Gly Ala 
50 55 

Ser Pro Val Met His Pro lie His 
65 70 

Leu Leu Pro Ser Cys Leu Gin Leu 
85 



Arg Ala Val Cys Pro Ser Arg Ser 
45 

Ala Ala Ser Arg Glu Gin Gly Val 
60 

Pro Val His Arg Cys Leu Ala Ser 
75 80 

Xaa Ser Thr Xaa 
90 



<210> 123 
<211> 347 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (242) 

<223> Xaa equals any of the naturally occurring L-amino acids 

<220>\ _ . : " 

<221> SITE * ' - ■ - 

<222> (246) 

<223> Xaa equals any of the naturally occurring L-amino acids 
<220> 

<221> SITE 
<222> (347) 

<223> Xaa equals stop translation 
<400> 123 

Met Arg Arg Gly Ala Gly Ala Ala Arg Gly Arg Ala Ser Trp Cys Trp 
1 5 10 15 

Ala Leu Ala Leu Leu Trp Leu Ala Val Val Pro Gly Trp Ser Arg Val 

20. - 25 ~ - - 30 " 

Ser Gly lie Pro Ser Arg Arg His Trp Pro Val Pro Tyr Lys Arg Phe 
35 40 45 

Asp Phe Arg Pro Lys Pro Asp Pro Tyr Cys Gin Ala Lys Tyr Thr Phe 
50 55 60 

Cys Pro Thr Gly Ser Pro lie Pro Val Met Glu Gly Asp As p As p Ile_ 
65 70 75"" 80 

Glu Val Phe Arg Leu Gin Ala Pro Val Trp Glu Phe Lys Tyr Gly Asp 
85 90 95 



Leu Leu Gly His Leu Lys lie Met His Asp Ala lie Gly Phe Arg Ser 
100 105 110 
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Thr Leu Thr Gly Lys Asn Tyr Thr Met Glu Trp Tyr Glu Leu Phe Gin 
115 120 125 

Leu Gly Asn Cys Thr Phe Pro His Leu Arg Pro Glu Met Asp Ala Pro 
130 135 140 

Phe Trp Cys Asn Gin Gly Ala Ala Cys Phe Phe Glu Gly lie Asp Asp 
145 150 155 160 

Val His Trp Lys Glu Asn Gly Thr Leu Val Gin Val Ala Thr lie Ser 
165 170 175 

Gly Asn Met Phe Asn Gin Met Ala Lys Trp Val Lys Gin Asp Asn Glu 
180 185 190 

Thr Gly lie Tyr Tyr Glu Thr Trp Asn Val Lys Ala Ser Pro Glu Lys 
195 200 . 205 

Gly Ala Glu Thr Trp Phe Asp Ser Tyr Asp Cys Ser Lys Phe Val Leu 
2i0 215 220 

Arg Thr Phe Asn Lys Leu Ala Glu Phe Gly Ala Glu Phe Lys Asn lie 
225 230 235 240 

Glu Xaa Asn Tyr Thr Xaa lie Phe Leu Tyr Ser Gly Glu Pro Thr Tyr 
245 250 255 

Leu Gly Asn Glu Thr Ser Val Phe Gly Pro Thr Gly Asn Lys Thr Leu 
260 265 270 

Gly Leu Ala lie Lys Arg Phe Tyr Tyr Pro Phe Lys Pro His Leu Pro 
275 280 285 

Thr Lys Glu Phe Leu Leu Ser Leu Leu Gin lie Phe Asp Ala Val lie 
290 295 300 

Val His Lys Gin Phe Tyr Leu Phe Tyr Asn Phe Glu Tyr Trp Phe Leu 
305 310 315 320 



Pro Met Lys Phe Pro Phe lie Lys lie Thr Tyr Glu Glu lie Pro Leu 
325 330 335 

Pro lie Arg Asn Lys Thr Leu Ser Gly Leu Xaa 
340 345 



<210> 124 

<2 11>_2 3.4 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> SITE 
<222> (173) 
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<223> Xaa equals any of the naturally occurring L-araino acids 
<220> 

<221> SITE 
<222> (234) 

<223> Xaa equals stop translation 
<400> 124 

Met His Arg Gly Lys Leu Asp Cys Ala Gly Gly Ala Leu Leu Ser Ser 
15 10 15 

Tyr Leu lie Val Leu Met lie Leu Leu Ala Val Val lie Cys Thr Val 
20 25 30 

Ser Ala lie Met Cys Val Ser Met Arg Gly Thr He Cys Asn Pro Gly 
35 40 45 

Pro Arg Lys Ser Met Ser Lys Leu Leu Tyr lie Arg Leu Ala Leu Phe 
50 55 60 

Phe Pro Glu Met Val Trp Ala Ser Leu Gly Ala Ala Trp Val Ala Asp 
65 70 75 80 

Gly Val Gin Cys Asp Arg Thr Val Val Asn Gly He He Ala Thr Val 

85 . . ' - 90 "' . 95. 

Val Val Ser Trp He He He Ala Ala Thr Val Val Ser He He He 
100 105 110 

Val Phe Asp Pro Leu Gly Gly Lys Met Ala Pro Tyr Ser Ser Ala Gly 
115 120 125 

Pro Ser His Leu Asp Ser His Asp Ser Ser Gin Leu Leu Asn Gly Leu 
130 135 140 

Lys Thr Ala Ala Thr Ser Val Trp Glu Thr Arg He Lys Leu Leu Cys 
145 150 155 160 

Cys Cys He Gly Lys Asp Asp His Thr Arg Val Ala Xaa Ser Ser Thr 

_ 165 170 175 

Ala Glu Leu Phe Ser Thr Tyr Phe Ser Asp Thr Asp Leu Val Pro Ser 
180 185 190 

Asp He Ala Ala Gly Leu Ala Leu Leu His Gin Gin Gin Asp Asn He 
195 200 205 

Arg Asn Asn Gin Asp Leu Pro Arg* Trp Ser Ala Met Pro Gin Gly Ala 

210 21 5 2 20 : 



Pro Arg Lys Leu He Trp Met Gin Asn Xaa 
225 230 



<210> 125 
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<211> 54 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> SITE 
<222> (54) 

<223> Xaa equals stop translation 
<400> 125 

Met Gin Gly Val Leu Phe Gly Phe Val Trp Leu Phe Ser Phe Leu Trp 
15 10 15 

Gin Glu Asn Lys Ser Ser Ala Ser Pro Ser Thr Leu Ala Lys Ser Gly 
20 25 30 

Ser Pro Cys Pro Val Ser lie Pro Trp Met Pro Gly Val Leu Val Arg 
35 40 45 

Phe Phe. Thr Leu Leu Xaa 
50 



<210> 126 

<211> 82 - . 

<212> PRT "~ - " " - * 

<213> Homo sapiens ' 

<220> 

<221> SITE 
<222> (44) 

<223> Xaa equals any of the naturally occurring L-amino acids 
<220> 

<221> SITE 
<222> (82) 

<223> Xaa equals stop translation 
<400> 126 

Met Arg Met Arg Val Ala Val Ala Pro Arg Pro His Gin His Leu Val 
1 5 10 15 

Val Ser Val Ser Trp He Leu Ala He Leu He Ser Val Ser Gly Tyr 
20 25 30 

His Cys Phe His Leu Gin Phe Ser Tyr Met Val Xaa Asn He Phe Pro 
35 40 45 

-His— Val -Tyr— Leu -Ser— Ser— Ala—! Tyr—Leu— Leu- Arg— Pro-Va-1- He- Cys— Ser- 

50 55 60 

Asp Leu Leu Pro Val Phe Val Cys Leu His Val Cys Leu Cys Leu He 
65 70 75 80 

Phe Xaa 
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<210> 127 
<211> 42 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (42) 

<223> Xaa equals stop translation 
<400> 127 

Met Gly Trp Glu Ala Ala Leu Ala Leu Leu Val Ser Ala Val Phe Phe 
1 ""5 10 15 

Pro Trp Cys Thr lie Gin Arg Pro Asp Val Gly Thr Thr Ser Pro Gly 
20 25 30 

Gly Leu Glu Arg Arg Ser Lys Gly Phe Xaa 
35 40 



:<210> 128 - ' .. "\ - 

<2ii> 66 ; ■ "* - ' ' 

<212> PRT 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (66) 

<223> Xaa equals stop translation 
<400> 128 

Met Thr Phe Met lie Leu Lys Phe Phe Phe Leu Cys Gly Phe Val Leu 
15 10 15 

Asn Arg Leu lie Ala Arg Gin Leu Ala Lys lie His Ala lie His Ala 

20 25 _ 30 

His Asn Gly Trp lie Pro Lys Ser Asn Leu Trp Leu Lys Met Gly Lys 
35 40 45 

Tyr Phe Ser Leu lie Pro Thr Gly Phe Ala Asp Glu Asp lie Asn Lys 
50 55 60 

Arg Xaa 



<210> 129 
<211> 50 
<212> PRT 

<213> Homo sapiens 
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<220> 

<221> SITE 
<222> (50) 

<223> Xaa equals stop translation 
<400> 129 

Met lie Val Asn His Phe Ser Phe Leu Phe Cys Trp lie Val Phe Cys 
1 5 10 15 

Phe Leu Leu Gin His Ser Cys Phe Cys Cys Ala Tyr Phe Trp Ser Phe 
20 25 30 

Asp Ser Leu Cys His Cys Phe Leu Ser His Thr Pro Leu Arg Phe Thr 
35 40 45 

Gin Xaa 
50 



<210> 130 
<211> 227 
<212> PRT 
<213> Homo sapiens 

<220> "~ 
<221> SITE 
<222> (227) 

<223> Xaa equals stop translation 
<400> 130 

Met Glu Thr Val Val lie Val Ala He Gly Val Leu Ala Thr He Phe 
1 5 10 15 

Leu Ala Ser Phe Ala Ala Leu Val Leu Val Cys Arg Gin Arg Tyr Cys 
20 25 30 

Arg Pro Arg Asp Leu Leu Gin Arg Tyr Asp Ser Lys Pro He Val Asp 
35 40 45 

Leu He Gly Ala Met Glu Thr Gin Ser Glu Pro Ser Glu Leu Glu Leu 
50 55 60 

Asp Asp Val Val He Thr Asn Pro His He Glu Ala He Leu Glu Asn 
65 70 75 80 

Glu Asp Trp He Glu Asp Ala Ser Gly Leu Met Ser His Cys He Ala 
85 • 90 95 



He Leu Lys He Cys His Thr Leu Thr Glu Lys Leu Val Ala Met Thr 

100 105 110 

Met Gly Ser Gly Ala Lys Met Lys Thr Ser Ala Ser Val Ser Asp He 

115 120 125 
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He Val Val Ala Lys Arg He Ser Pro Arg Val Asp Asp Val Val Lys 
130 135 140 



Ser Met Tyr Pro Pro Leu Asp Pro Lys Leu Leu Asp Ala Arg Thr Thr 
145 150 155 160 

Ala Leu Leu Leu Ser Val Ser His Leu Val Leu Val Thr Arg Asn Ala 
165 170 175 

Cys His Leu Thr Gly Gly Leu Asp Trp He Asp Gin Ser Leu Ser Ala 
180 185 190 



Ala Glu Glu His Leu Glu Val Leu Arg Glu Ala Ala Leu Ala Ser Glu 
195 200 205 

Pro Asp Lys Gly Leu Pro Gly Pro Glu Gly Phe Leu Gin Glu Gin Ser 
210 . 215 220 



Ala He Xaa 
225 



<210> 131 
<211> 118 
<212> PRT 
<213> Homo 



sapiens 



<220> 

<221> SITE 
<222> (118) 

<223> Xaa equals stop translation 



<400> 131 

Met Gin Arg He Ala Ser Leu Leu 
1 5 

Ala Ala Gly Ser Thr Pro Ala Glu 
20 

Ser Leu, Ser Ala Thr ._Pr.o_.Ser Leu. 

35 40 



Thr Leu Leu Thr ■ Gin Leu Thr Leu 
10 15 

Thr He Ser Asp Ser Ala Glu Ala 
25 30 

Val Thr Trp Thr- Gin -Val- Ser Gly 
45- 



Leu Gin Pro Leu Val Glu Pro Cys 
50 55 

Ser Arg Pro Glu Met Trp Arg Ala 
65 70 

Leu -Leu— Phe— Leu-Gly-A-la-Tyr-Tyr 
85 

Ser Cys Pro Glu Asp Trp Leu Gin 
100 



Leu Arg Gin Thr Leu Lys Leu Leu 
60 

Val Gly Pro Val Pro Val Ala Cys 
75 80 

-Gin™ Ala - Trp - Ser _ Gln~ Gl~n — Pro - Ser 
90 95 

Asp Met Glu Arg Leu Ser Glu Ser 
105 110 



Cys Cys Cys His Cys Xaa 
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115 



<210> 132 
<211> 306 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (180) 

<223> Xaa equals any of the naturally occurring L-amino acids 
<220> 

<221> SITE 
<222> (197) 

<223> Xaa equals any of the naturally occurring L-amino acids 
<220> 

<221> SITE 
<222> (306) 

<223> Xaa equals stop translation 
<400> 132 

Met Ser Glu Asp Arg_ Pro. Met Leu. Gin Phe Leu Leu His Thr Ser. Phe 
1 5 ""- 10 15 

Leu Ser Pro Leu Phe lie Leu Trp Leu Trp Thr Lys Pro lie Ala Arg 
20 25 30 

Asp Phe Leu His Gin Pro Pro Phe Gly Glu Thr Arg Phe Ser Leu Leu 
35 40 45 

Ser Asp Ser Ala Phe Asp Ser Gly Arg Leu Trp Leu Leu Val Val Leu 
- 50 55 60 

Cys Leu Leu Arg Leu Ala Val Thr Arg Pro His Leu Gin Ala Tyr Leu 
65 70 75 80 

Cys -Leu Ala Lys Ala— Arg Val Glu Gin Leu Arg Arg- Glu Ala Gly- Arg - 
85 90 95 

lie Glu Ala Arg Glu lie Gin Gin Arg Val Val Arg Val Tyr Cys Tyr 
100 105 110 

Val Thr Val Val Ser Leu Gin Tyr Leu Thr Pro Leu lie Leu Thr Leu 
115 120 125 



Asn Cys Thr Leu Leu Leu Lys Thr Leu Gly Gly Tyr Ser Trp Gly Leu 
130 135 140 

Gly Pro Ala Pro Leu Leu Ser Pro Arg Pro lie Leu Ser Gin Arg Cys 
145 150 155 160 

Pro His Arg Leu Trp Gly Gly Arg Ser Pro Ala Asp Cys Ser Ala Asp 
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165 170 175 

Cys Arg Gly Xaa Gly Trp Pro Ala Tyr Ser Pro Leu Pro Pro Trp Arg 
180 185 190 

Pro Gly Leu Pro Xaa Leu Val Asp Gly Cys Leu Pro Ala Ala Arg Gin 
195 200 205 

Pro Phe Arg Pro Leu Leu Pro Pro Ala Leu Gly Arg Leu Leu Ala Ala 
210 215 220 

Cys Arg Pro Ser Trp Gly Pro Glu Val Cys Ser Trp Gly Ser Gly Thr 
225 230 235 240 

Leu Ala Cys Pro Leu Cys Leu Arg Pro Arg Val Pro Ser Cys Lys Val 
245 250 255 

Gly Pro Asp Ser Pro Ala Phe Pro Ser Pro Gin Cys Leu Thr Arg Gly 
260 265 270 

Pro Pro Trp Thr Pro Ser Phe Cys Leu Arg Thr Val Ser Pro Gly Pro 
275 280 285 

Ser Ser Met Arg Val Pro Arg Pro Leu Ser Pro Lys Arg Met Cys Gin 
290 - _ ,295 ' ; _ 300 

Val Xaa 
305 



<210> 133 
<211> 45 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (45) 

<223> Xaa equals stop translation 



<400> 133 

Met Ser Tyr Ser Leu Phe Leu Ala Leu Leu Ser Phe Ala Ser Ala lie 
1 5 10 15 

Leu Phe Val Ala Gly Thr lie Ala Gly Thr Gly Gly Leu Ser Phe His 
20 25 30 

Gly lie Ala Thr lie Phe Val Leu -Thr Gly Lys Trp Xaa 
35— 40 45 



<210> 134 

<211> 44 

<212> PRT * 

<213> Homo sapiens 
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<220> 

<221> SITE 
<222> (6) 

<223> Xaa equals any of the naturally occurring L-amino acids 
<220> 

<221> SITE 
<222> (44) 

<223> Xaa equals stop translation 
<400> 134 

Met Gly Arg Leu Gly Xaa Gin Cys Leu Leu Phe Leu Ala Phe Lys Ala 
15 10 15 

lie Ser Gly Val Phe Phe Leu Phe Trp Arg Pro Ala Asp Ser Thr Glu 
20 25 30 

Arg Asn Thr Gin Ser Trp Asp Phe Pro Pro Leu Xaa 
35 40 



<210> 135 

<211> 50 

<212>" PRT 

<213> Homo sapiens 



<220> 

<221> SITE 
<222> (50) 

<223> Xaa equals stop translation 
<400> 135 

Met Gly Val Gly Val Leu Arg lie Leu Leu Ser Cys Leu Gly Glu Ala 
1 5 10 15 

Ala Pro Lys Ser Ala Gly Thr Ser Leu Glu Ser Ala Lys Glu Cys Trp 
20 25 30 

Ser Ala Ala Thr Leu Leu . Val" Leu ' Cys Val" Leu Cys" Gin Leu Gin" "His" 
35 40 45 

Gly Xaa 
50 



<210> 136 

<211> _81 . -_ 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> SITE 
<222> (81) 

<223> Xaa equals stop translation 
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<400> 136 
Met Glu Ser Leu 
1 

Ser Leu Leu Ala 
20 

Asn Ser Gin Phe 
35 

lie Ala Gin Val 
50 

Arg Val Leu Gin 
65 

Xaa 



Pro Glu Asn Lys 
5 

lie He Gly Leu 

Gly Leu Val Asp 
40 

Leu Leu Leu Asp 
55 

Phe Phe Leu Gly 
70 



Pro Leu Val Trp 
10 

Leu Leu Gly Ser 
25 

He Pro Val Glu 



Phe Cys Leu Ala 
60 

Thr Pro Lys Leu 
75 



Ser Leu Ala Val 
15 

Ser Pro Asp Phe 
30 

Phe Lys Leu Val 
45 

Leu Leu Ala Asp 



Lys Val Pro Ser 
80 



<210> 137 
<211> 277 
<212> PRT 

<213> Homo sapiens 

<220> 

<221> SITE 
<222> (94) 

<223> Xaa equals any of the naturally occurring L-amino acids 
<220> 

<221> SITE 
<222> (103) 

<223> Xaa equals any of the naturally occurring L-amino acids 
<220> 

<221> SITE 
<222> (277) 

<223>- Xaa equals stop translation ~ 

<400> 137 

Met He His Val Asn Arg Asn He Met Asp Phe Lys Leu Phe Leu Val 
15 10 15 

Phe Val Ala Gly Val Phe Leu Phe Phe Tyr Ala Arg Thr Leu Glu Ser 
20 25 30 



Lys Pro Tyr Phe Leu Leu Leu Leu Gly Asn Cys Ala Arg Cys Ser Asn 
35 40 45 

Asp He Val Phe Val Leu Leu Leu Val Lys Arg Phe He Arg Ser He 
50 55 60 

Ala Pro Phe Gly Ala Leu Met Val Gly Cys Trp Phe Ala Ser Val Tyr 
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65 70 75 80 

lie Val Cys Gin Leu Met Glu Asp Leu Lys Trp Leu Trp Xaa Glu Asn 
85 90 95 

Arg lie Tyr Val Ser Gly Xaa Val Leu lie Val Gly Phe Phe Ser Phe 
100 105 110 

Val Val Cys Tyr Lys His Gly Pro Leu Ala His Asp Arg Ser Arg Ser 
115 120 125 

Leu Leu Met Trp Met Leu Arg Leu Leu Ser Leu Val Leu Val Tyr Ala 
130 135 140 

Gly Val Ala Val Pro Gin Phe Ala Tyr Ala Ala lie lie Leu Leu Met 
145 150 155 160 

Ser Ser Trp Ser Leu His Tyr Pro Leu Arg Ala Cys Ser Tyr Met Arg 
165 170 175 

Trp Lys Met Glu Gin Trp Phe Thr Ser Lys Glu Leu Val Val Lys Tyr 
180 185* 190 

Leu Thr Glu Asp Glu Tyr Arg Glu Gin Ala Asp Ala Glu Thr Asn Ser 
195 20(K - "'- 205 . 

Ala Leu Glu Glu Leu Arg Arg Ala Cys Arg Lys Pro Asp Phe Pro Ser 
210 215 22Q 

Trp Leu Val Val Ser Arg Leu His Thr Pro Ser Lys Phe Ala Asp Phe 
225 230 235 240 

Val Leu Gly Gly Ser His Leu Ser Pro Glu Glu lie Ser Leu His Glu 
245 250 255 

Glu Gin Tyr Gly Leu Gly Gly Ala Phe Leu Glu Glu Gin Leu Phe Asn 
260 265 270 

Pro Ser Thr Ala Xaa 

—275 " — • " ~ 



<210> 138 

<211> 57 

<212> PRT 

<213> Homo sapiens 

<220> • 

<2"2T>~~S'ITE 
<222> (57) 

<223> Xaa equals stop translation 
<400> 138 

Met Cys Gin Thr Leu Pro Ala Arg Leu Arg Ala Gin Cys lie Ser Ser 
15 10 15 
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Leu Leu Phe Leu Leu Met Gly Leu Leu Ala Met Thr Gly Glu Arg Asn 
20 25 30 

Gin Gly Thr His Tyr Tyr Glu Phe Ser Gly Phe lie Phe Lys Ser Gin 
35 40 45 

Met Met Trp Ser lie Lys Pro Asn Xaa 
50 55 



<210> 139 

<211> 71 

<212> PRT 

<:213> Homo sapiens 

<220> 

<221> SITE 
<222> (71) 

<223> Xaa equals stop translation 
<400> 139 

Met Tyr Leu Trp Phe Ser Phe Ser Thr Val Gly Leu Cys Gly Cys Cys 
15 10 15 

Leu Leu Tyr Arg Ala Cys Gly * Phe lie Trp Tyr Leu Leu Leu Leu Gly 
20 25 30 

His Ser Ser Thr Asn Ser Leu Gin Asp Gly Gly Ala Glu Arg Pro Glu 
35 40 45 

His Pro Trp Ala His Val Arg Tyr Ser Cys Arg Arg Glu Leu Ser Phe 
50 .55 60 

Trp Phe Tyr Val Phe Asn Xaa 
65 70 



<210> 140 

<211>--63- - - 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> SITE 
<222> (63) 

<223> Xaa equals stop translation 



-<400>— 140 

Met Glu Pro Glu Ser Trp Ala Leu Cys Leu Leu Leu Phe Leu Gly Thr 
1 5 10 15 

Ala Leu Gly Tyr Pro Pro Leu Pro Arg His Ser Ser Lys Cys Glu lie 
20 25 30 
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Leu Glu Val Arg Leu His Leu Leu Pro Leu Leu lie Asn lie Gly Met 
35 40 45 

Met Ser Pro Val Ala Ser Pro Phe Val Cys Ser lie Thr Gly Xaa 
50 55 60 



<210> 141 
<211> 89 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (89) 

<223> Xaa equals 'stop translation 
<400> 141 

Met Leu Phe Leu Ser Ala Ser lie Cys Thr Ser Ala Leu Phe Leu Cys 
1 5 10 15 

Leu Ser Arg Leu Thr lie Ser Ala Pro His Pro Ala Trp Trp Gly Arg 
20 25 30 

Met Pro Thr His Thr Ser Pro Gly His Leu Leu Glu Leu Gin Pro Arg 

.35 ~- " " . 40; " '■ * -45 ■ 

Gly Met Thr Glu Ser lie Leu Phe Ser lie Ser Ala Leu Val Ser Asn 
50 55 60 

Ser Trp Gly Lys Met Thr Gin Leu Thr Ser Gly Ser His Ser Trp Ser 
65 70 75 80 

Ser Gly Leu Gin Asn Phe Gin Ala Xaa 
85 



<210> 142 
<211> 46 

<212> _PRT _. _ __ 

<213> Homo sapiens 

<220> 

<221> SITE 
<222> (46) 

<223> Xaa equals stop translation 
<400> 142 

-Met— Arg— Pro-Val— Cys— Ser— Leu"Gly-Trp- ATa - Gly Trp - Prb~GIy~~ Leu~VaT~ 
15 10 15 



Cys Gly Leu Arg Ala Leu Leu Gly Pro Ser Leu Phe Pro Val Thr Phe 
20 25 30 

Gly Ala Thr Glu Ala Val His Ser Leu Asp Val Cys Ser Xaa 
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35 



40 



45 



<210> 143 
<211> 56 
<212> PRT 

<213> Homo sapiens 



<220> 

<221> SITE 
<222> (56) 

<223> Xaa equals stop translation 
<400> 143 

Met Val Asn Glu Lys Glu Ala Arg Thr Gly Ser Pro Lys Ser Trp Leu 
1 5 10 15 

Leu Cys Leu Ala Leu Leu Leu lie Lys Tyr Val Thr Phe Cys Lys Pro 
20 25 30 

Tyr Leu Thr Lys Pro Tyr Phe Leu His Leu Ser Val Leu Asp Gin Leu 
35 40 45 



Ser Pro Gly Thr Pro Leu Asp Xaa 
"50 . 55 



<210> 144 

<211> 77 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> SITE 
<222> (77) 

<223> Xaa equals stop translation 



<400> 144 

Met Phe lie Ala lie Tyr Phe Lys Ala Phe His Gly Ser Phe Gin Leu 

1 5" 10 " 15 

Cys Thr Trp Leu Val lie Met lie Val lie Leu Gly Gin Ser Phe Ser 
20 25 30 

Ala Leu Ala Leu Leu Thr Phe Trp Leu lie Leu Gys Cys Arg Gly Cys 
35 40 45 

_Pro_J/a2_Jiis_Cyj3„ ,Leu_Leu_ 
50 55 60 

Asn Ala Arg Ser Asn Thr Val Pro Pro Ala Gin Leu Xaa 
65 70 75 



<210> 145 
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<211> 43 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> SITE 
<222> (43) 

<223> Xaa equals stop translation 
<400> 145 

Met Phe Phe Leu Ser Met Phe Leu His lie Val Leu Leu His Cys Gly 
1 5 10 15 

Asn Ser Phe Tyr Lys lie Cys His Ser Trp Asp Tyr Ala Ala Leu Gin 
20 25 30 

Glu Ser Thr Arg Phe Tyr Ser Asn Ser Tyr Xaa 
35 40 



<210> 146 

<211> 102 

<212> PRT 

<213> Homo sapiens 

<220> * ■ 

<221> SITE 
<222> (67) 

<223> Xaa equals any of the naturally occurring L-amino acids 
<220> 

<221> SITE 
<222> (102) 

<223> Xaa equals stop translation 
<400> 146 

Met Glu Leu Glu Arg Cys Ser Val Val Leu Cys lie Leu Ala Asn Leu 
1 5 .10 15 



Ala Val Leu Arg Ala Leu Phe Leu Pro Cys lie lie Phe His Cys Val 
20 25 30 

Ser Asp Ser Arg Ser Val Asn Arg Glu Thr Lys Val Lys Phe .Val His 
35 40 45 

Thr Ser Val His Gly Val Gly His Ser Phe Val Gin Ser Ala Phe Lys 
50 55 60 



Ala Phe Xaa Leu Val Pro Pro Glu Ala Val Pro Glu Gin Lys Asp. Pro 
65 70 75 80 

Asp Pro Glu Phe Pro Thr Val Lys Tyr Pro Asn Pro Glu Glu Gly Lys 
85 90 95 

Gly Val Leu Val Thr Xaa 
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100 



<210> 147 
<211> 134 
<212> PRT 
<213> Homo sapiens 



<220> 

<221> SITE 
<222> (134) 

<223> Xaa equals stop translation 
<400> 147 

Met Arg Val Pro Leu Val Leu Ser Trp Ala Phe Val Leu Val Gly Phe 
1 5 10 15 



Ser Gly Val Tyr Leu Ala Ser Glu Ser Phe Trp Phe Pro Pro Ser Leu 
20 25 30 

Cys Asp Leu Thr Ser Pro Pro Gly Leu His Leu Trp Lys Phe lie Arg 
35 40 45 

Asp Leu Val Ser Met Glu Glu Leu Thr Asp Ser Ala Arg Glu Met Gly 

50 _ . 55 ' ~ _ . 60" 

Tyr Trp Met Met Val Phe Ser Leu Lys Ala Met Phe Pro Val Ser Ser 
65 70 75 80 

Gly Cys Phe Gin Glu Arg Gin Glu Thr Asn Lys Ser Leu Thr Leu Leu 
85 90 95 

Arg Cys Ser Gin Arg Asp Thr Ser Pro Leu Met Asp Gly Gin Thr Trp 
100 105 110 

Ala Arg Val Arg Val Thr Lys Pro Pro Thr Thr Ala Thr Ala Ala Tyr 
115 120 125 

Asn Arg His lie Arg Xaa 

130 ' ■ " ' " " 



<210> 148 
<211> 50 
<212> PRT 

<213> Homo sapiens 
<220> 



<221> SITE 
<222> (50) 

<223> Xaa equals stop translation 
<400> 148 

Met Lys Ser Leu Phe Cys He Tyr Phe Leu Arg Trp Pro Met Gly Leu 
15 10 15 
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Ser Trp Gly Glu Thr Phe lie Leu Leu Arg Asp Ser Leu Ala lie Asn 
20 25 30 

Phe Gin Ser Phe Ser Lys Ala Ala Ser Gly Asp lie Phe Gly Cys His 
35 40 45 

Asp Xaa 
50 



<210> 149 

<211> 64 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> SITE 
<222> (6) 

<223> Xaa equals any of the naturally occurring L-amino acids 
<220> 

<221> SITE 
<222> (64) 

<223> Xaa equals stop translation 
<400> 149 

Met Ser Cys Gly Leu Xaa Phe Gly Pro Trp Phe Val Pro Met Leu Leu 
1 5 10 15 

Met Ser His Ser Leu Leu Pro Ser Trp Ser Gly Leu Trp Val Thr Thr 
20 25 30 

Trp Asn Gly Ser Ser Gly Glu Arg Thr Pro Ser Pro Trp Arg Arg Lys 
35 40 45 



Arg Ala Ser Gin Ser Ala Gly Arg lie Ala Ser Trp Met Ser Phe Xaa 
50 55 60 



<210> 150 

<211> 75 

<212> PRT 

<213> Homo sapiens 



<220> 

<221> SITE 
<222> (59) 

<22 3> Xaa equals any of the naturally occurring L-amino acids 
<220> 

<221> SITE 
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<222> (75) 

<223> Xaa equals stop translation 



<400> 150 

Met Leu Ser Ser Pro Asn Leu Ala 
1 5 

Ser Gly Ser Ala Thr Asn Trp Ala 
20 

Ser Arg Cys Gly Trp Lys Val Ser 
35 40 

Ser Ser Ala Leu Trp Val Ser Cys 
50 55 

Pro Gly Gly Arg Glu Pro Arg His 
65 70 



Ala Ser Leu Leu Cys Leu Trp His 
10 15 

Pro Pro Cys Ala Gly Met Trp Ala 
25 30 

Pro His Pro Glu Ala Gly Pro Cys 
45 

Cys Val Xaa Ala Glu Gin Pro Gin 
60 

Arg Gly Xaa 
75 



<210> 151 

<211> 55 

<212> PRT 

<213> Homo sapiens 

<220> ~~ - - . " -"'*■" 

<221> SITE 
<222> (55) 

<223> Xaa equals stop translation 
<400> 151 

Met Pro His lie Ser Phe Cys Leu Gly Thr Pro Tyr Val Val Ala Val 
1 5 10 15 



Tyr Leu Pro Ala Trp lie Val Met Leu Leu Leu Pro Gly Val Arg Pro 
20 25 30 

Tyr Ser Ser Leu Gin Ala Leu Lys His Pro Ser Cys Ser Ser Ser Ser 
35 40 45 

Val Cys Ala Pro Tyr Met Xaa 
50 55 



<210> 152 

<211> 58 

<212> PRT 

< 2 1 3 > Homo sapiens 



<220> 

<221> SITE 
<222> (58) 

<223> Xaa equals stop translation 
<400> 152 
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Met Gly Leu Asn lie Ser Pro Trp Cys Phe Leu Ala lie Leu Thr Cys 
1.5 10 15 

Ala He Ser Ala Ala Phe He Ser Val Gly Val Val Cys Trp Leu Leu 
20 25 30 

Phe Leu He Ser His Arg Ser Ser Lys Asn Leu Arg Lys Ser Arg Val 
35 40 45 

Arg Gly Val Trp Glu Asn Glu Glu He Xaa 
50 55 



<210> 153 
<211> 53 
<212> PRT 

<213> Homo sapiens 

<220> • 
<221> SITE 
<222> (53) 

<223> Xaa equals stop translation 
<400> 153 

Met Ala Tyr Val Leu .Ala Val Leu Gys.Phe Lys Ser Leu. Trp Ala Leu 
1 5 * ■ • 10 15 

Phe Lys Pro Asn Lys Gin Leu He Glu Phe Leu Leu Met Val Lys Val 
20 25 30 

Val Lys He Pro Leu Cys Tyr Leu Arg Gin Leu Leu Gly Gly He Lys 
35 40 45 

Thr Pro Arg Val Xaa 
50 



<210> 154 
<211> 51 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (51) 

<223> Xaa equals stop translation 

< 400> 1 54 I 

Met Asp Gly Gly Pro Gly Ala Phe Ser Arg Ala Trp Val Leu Gin He 
15 10 15 

Pro Trp Leu Leu Leu Ser Gly Gly Asn Phe Ala Leu Cys Glu Pro Arg 
20 25 30 

Pro Cys Pro Ser Ala Gly His Pro Trp Gin Glu Ala Gly Leu Pro Ser 
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35 



40 



45 



Ser Pro Xaa 
50 



<210> 155 

<211> 67 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> SITE 
<222> (55) 

<223> Xaa equals any of the naturally occurring L-amino acids 
<220> 

<221> SITE 
<222> (67) 

<223> Xaa equals stop translation 
<400> 155 

Met Pro Phe Leu Ser Val Trp Phe Phe Asn Leu 
15 10 

Val Glu Ser Phe Val Leu : Arg Ala Val Leu Phe lie Ala Gly Cys Ser 
20 25 30 

Ala Thr Ser Gin Met Glu Ala Ala Ser Pro Tyr Pro Ala Val Thr Lys 
35 40 45 

Arg Lys Lys Asn Val Ser Xaa His Cys Gin lie Ser Ser Gly Gly Ala 
50 55 60 

Pro Gly Xaa 
65 



Gly Leu lie Phe Gly 
15 



<210> 156 

<211>-49 - — 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> SITE 
<222> (49) 

<223> Xaa equals stop translation 



"<40~0>~T5~6 

Met Leu Leu Lys Arg Asn Leu Leu lie Leu lie Leu Phe Leu Val Thr 
15 10 15 

Cys Phe Asn Phe Val Ser Phe Phe Phe Phe Pro Trp Lys Leu Leu Gly 
20 25 30 
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Ser Pro Phe Tyr Pro Cys Ser Leu Arg Ser Asp Asn Asp Gly Cys Val 
35 40 45 

Xaa 



<210> 157 

<211> 61 

<212> PRT 

<213> Homo sapiens 

<400> 157 

Met Gly Ser Phe Leu His Pro Gin Trp His Leu Leu lie Thr Phe Cys 
1 .5 10 15 

Ala Val Leu Gly Lys Gly Leu His Ser Asp Pro Ser Arg Pro Phe Glu 
20 25 30 

His Gly Gly Ala Leu Gly Lys Val Pro Arg Gly Arg Ser Thr Leu Leu 
35 40 45 

Ser Lys Glu Val Leu Leu Lys Lys Lys Lys Lys Lys Arg 
50 55 60 



<210> 158 
<211> 118 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> SITE 
<222> (113) 

<223> Xaa equals any of the naturally occurring L-amino acids 
<220> 

<221> SITE 
<222> (118) 

<223> Xaa equa 1 s"s top " trans lation 
<400> 158 

Met Leu Leu Trp Trp Gin Cys Leu Cys Cys His Ala Val Leu Glu Pro 
15 10 15 

Ala Ala Thr Ala Met Pro Glu Asp Ala Ala Pro Ser Ser Leu Pro Val 
20 25 30 



Pro Pro Asn Met Thr Ser Ser Arg Phe His Tyr Phe Trp Thr Leu Leu 

35 40 45 

Gin lie Lys Leu Thr Gin Phe Tyr Ser Lys Pro Arg Ser Leu Ser Ala 

50 55 60 

Thr Pro Glu Lys Asn lie Gly Leu Gin Glu Pro Glu Arg Arg Glu Arg 
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65 70 75 80 

Phe Thr Gly Glu Ser Cys Arg Trp Glu Leu Lys Aia Lys Ser Cys Leu 
85 90 95 

Cys Pro Thr Arg Asn Ser Leu Gly Cys Thr Gin Cys His Cys Asp Gly 
100 105 110 

Xaa Lys lie Cys Asn Xaa 





115 


<210> 


159 


<211> 


151 


<212> 


PRT 


<213> 


Homo sapiens 


<220> 




<221> 


SITE 


<222> 


(151) 


<223> 


Xaa equals stop 


<400> 


159 



Met -Leu Ala Val Leu Ala Phe Pro Val Gly Val Phe Val Val Ala Val 

1 -5 .-10 15 . 

Phe Trp lie lie Tyr Ala Tyr Asp Arg Glu Met lie Tyr Pro Lys Leu 
20 25 30 

Leu Asp Asn Phe lie Pro Gly Trp Leu Asn His Gly Met His Thr Thr 
35 40 45 

Val Leu Pro Phe lie Leu lie Glu Met Arg Thr Ser His His Gin Tyr 
50 55 60 

Pro Ser Arg Ser Ser Gly Leu Thr Ala lie Cys Thr Phe Ser Val Gly 
65 70 75 80 

Tyr -I-le Leu Trp-Val Cys Trp-Val His His— Val Thr Gly-Met Trp Val _ 
85 90 95 

Tyr Pro Phe Leu Glu His lie Gly Pro Gly Ala Arg He He Phe Phe 
100 105 110 

Gly Ser Thr Thr He Leu Met Asn Phe Leu Tyr Leu Leu Gly Glu Val 
115 120 125 



I^u~Xsn ~Ash~~ Tyr~Ile~Trp~Asp~Thr~Gln-Lys-Ser -Me t-Glu-Glu Glu-Lys- 
130 135 140 

Glu Lys Pro Lys Leu Glu Xaa 
145 150 



<210> 160 
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<211> 92 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> SITE 
<222> (92) 

<223> Xaa equals stop translation 
<400> 160 

Met Gly Asp Lys Leu Gly Met Ala Arg Ala Pro Ser Val Ala Leu Ala 
15 10 15 

Gin Leu Trp Leu lie Cys Leu Cys Pro Glu Ser Leu Ala Ser Phe Val 
20 . 25 30 

Gin Ala Val Pro Trp Lys Val Leu Gin Pro Ser Ser Asn Arg Ser Thr 
35 40 45 

Asp Cys Ser Pro His Met Arg Pro Thr Cys Glu Thr Leu Gly Ser Arg 
50 55 60 

Lys Ala Gin Asp Leu Val Leu Asp Thr Met Cys Leu Ser Thr Asp Asp 

65 70 75 „_ 80 

Cys Gin Gly Leu lie Cys' Arg Giy His Arg Ser Xaa 
85 90 



<210> 161 

<211> 42 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> SITE 
<222> (42) 

<223> Xaa equals stop translation 
<400> 161 

Met Gin Val Ala Cys Val Met Lys Val Ser Ala Gin Trp Val Cys Phe 
15 10 15 

Phe Val Val Phe Ser Pro Leu Cys Ser Ser Val Lys Cys Ala Ser Ser 
20 25 30 

Gly Gin Asn Arg Gly Arg Gly Asp Gin Xaa 

3 5 40- = 



<210> 162 

<211> 78 

<212> PRT 

<213> Homo sapiens 



WO 99/47540 



PCT/US99/05804 



97 



<220> 

<221> SITE 
<222> (78) 

<223> Xaa equals stop translation 



<400> 162 

Met Met Leu Gin lie lie His Leu Asn Thr Leu lie Lys Phe Phe Gin 
15 10 15 

Cys Leu Lys Leu Phe Leu His Gly Thr Ala Gly Ser Gly Gin Lys Cys 
20 25 30 

Leu Ala Tyr Lys Phe Ser Gin Phe Pro Ser lie lie Pro Ala Ala His 
35 40 45 

Lys Lys Val His His Leu Leu Ser Pro Lys Cys Leu Pro Thr Glu Cys 
50 55 60 



Ser Gin Ala Asp Asn Ser Ser Trp Asp Ser Ala Val Trp Xaa 
65 70 75 



<210> 163 
<211> 55 

<212> PRT _ " " 

<213> Homo sapiens 

<220> 

<221> SITE 
<222> (55) 

<223> Xaa equals stop translation 



<400> 163 

Met Lys Arg Leu Trp Cys Leu Ser Trp Val Pro Gly Leu Gin Gly Ser 
15 10 15 

Pro Ser Val Leu Ser Ser Val Phe Phe Ser Val Phe Lys Pro Gin Leu 
20 25 30 



His Trp Thr Cys Ser Gin Val Ser Ser His Trp His Pro Pro Cys Leu 
35 40 45 



Phe lie Leu Phe Ser Gly Xaa 
50 55 



<210> 164 

<211>^9_0 

<212> PRT 

<213> Homo sapiens 



<220> 

<221> SITE 
<222> (90) 

<223> Xaa equals stop translation 
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<400> 164 

Met Lys Phe Leu Leu Ala Ala Leu 
1 5 

Ser Ser Gin Tyr lie Lys Trp lie 
20 

Ser Glu Phe Ser Phe Val Leu Gly 
35 40 

lie Ser Arg Glu Val Tyr Leu Leu 
50 55 



Val Leu Ser Leu lie Leu Pro Arg 
10 15 

Val. Ser Ala Gly Leu Ala Gin Val 
25 30 

Ser Arg Ala Arg Arg Ala Gly Val 
45 

lie Leu Ser Val Thr Thr Leu Ser 
60 



Leu Leu Leu Ala Pro Val Leu Trp Arg Ala Ala lie Thr Arg Cys Val 
65 70 75 80 

Pro Arg Pro Glu Arg Arg Ser Ser Leu Xaa 
85 90 



<210> 165 

<211> 45 

<212> PRT 

<213> Homo sapiens 



<220> 

<221> SITE 
<222> (45) 

<223> Xaa equals stop translation 



<400> 165 

Met Phe Val Trp His Leu Lys Val Met Val Met Phe lie lie Leu Tyr 
1 5 10 15 

Phe Ala Tyr Cys Glu Ser Asn Phe His Ser Val Leu Ser Val Ser Lys 
20 25 30 

Pro Leu Leu Lys lie Leu Phe Leu Pro Arg Asn Leu Xaa 
35 40" 45 



<210> 166 

<211> 45 

<212> PRT 

<213> Homo sapiens 

< 2 20 > - . 

<221> SITE 
<222> (45) 

<22 3> Xaa equals stop translation 
<400> 166 

Met Thr Pro Gly Cys Ser Val Pro Phe Leu Leu Cys Trp Leu Phe Ala 
15 10 15 
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Leu Met Met Gin Glu Lys Trp Gly Gly Val Lys Ser Leu Val Ser Tyr 
20 25 30 

His Tyr Ser Arg Gin Trp His Gin Thr Val Val Val Xaa 
35 40 45 



<210> 167 
<211> 66 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (66) 

<223> Xaa equals, stop translation 
<400> 167 

Met Ser lie Ala Leu Arg lie Asn Arg Leu His Phe Trp Val Leu Leu 
1 5 10 15 

Phe Phe Phe Phe Phe Ala Gin Leu Ser Leu Ser Val Asp Leu His Gly 
20 25 30 

Thr Ser Tyr Ser Leu Lys Ser* Leu Ser Tyr Leu Thr lie Phe Leu Asp 
35 40 45 

Leu Glu Lys Leu Asp Val Gly Pro Tyr Glu Lys lie lie Arg Asn Gin 
50 55 60 

lie Xaa 
65 



<210> 168 
<211> 62 
<212> PRT 

1? JL? > Homo sapiens 



<220> 

<221> SITE 
<222> (62) 

<223> Xaa equals stop translation 
<400> 168 

Met Gin Leu Thr Leu Gly Gly Ala Ala Val Gly Ala Gly Ala. Val Leu 

1 , 5 : 10 15 



Ala Ala Ser Leu Leu Trp Ala Cys Ala Val Gly Leu Tyr Met Gly Gin 
20 25 30 

Leu Glu Leu Asp Val Glu Leu Val Pro Glu Asp Asp Gly Thr Ala Ser 
35 40 45 
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Ala Glu Gly Pro Asp Glu Ala Gly Arg Pro Pro Pro Glu Xaa 
50 55 60 



<210> 169 
<211> 47 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (47) 

<223> Xaa equals stop translation 
<400> 169 

Met His Thr Ala Lys Met Ser Leu Leu Asn Ser Val Cys Leu Leu Val 
15 10 15 

Leu Ser lie Trp Tyr Val Val Lys Phe Pro Met Met Arg Asp Ser Thr 
20 25 30 

lie Asn Val Pro Tyr Leu Leu Arg Leu Lys Ala lie Thr Thr Xaa 
35 40 45 



<210> 170 
<211> 106 
<212> PRT 
< 2 1 3 > Homo s ap i ens 

<220> 

<221> SITE 
<222> (69) 

<223> Xaa equals any of the naturally occurring L-amino acids 
<220> 

<221> SITE 
<222> (106) 

<223> Xaa equals stop translation 



<400> 170 

Met Ser Gly Leu Ala Ala Ala Ala His Val Phe Arg Val Cys Leu Phe 
1 5 10 15 

Pro Leu Ser Trp Gly Ser . Ser Lys Thr Thr Phe lie His Gly Leu Ser 
20 25 30 

Se r Tyr lie _Ala_ Thr Pro V al L eu -Asn Ser lie Ph e Ser Ser_JTrp_Lys_ 
35 40 45 

Ser Arg Arg Lys Asp Thr Trp Thr Cys Leu Leu His Arg Leu Ser Ala 
50 55 60 

Phe Pro lie Ser Xaa Arg Arg Arg Asn Phe Ala Leu Phe Ser His Ser 
65 70 75 80 
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Cys Val Cys lie Arg Ser Ser Ser Asp Asp Val Gly Pro Thr Met Tyr 
85 90 95 

Ser Phe Ser Val Pro Cys Arg Val Lys Xaa 
100 105 



<210> 171 

<211> 45 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> SITE 
<222> (45) 

<223> Xaa equals stop translation 
<400> 171 

Met His Leu Leu Thr Leu Phe Ser Ser Gly Leu lie Phe Leu Gly Cys 
15 10 15 

Ser Thr. Pro Leu Ser Phe Cys Asp Cys Leu Pro lie Leu Leu Leu Trp 
20 25 ■ 30 

Leu Glu Phe Pro Val" Glu Thr- Ser. Gly Val Cys Ser Xaa " 
35 40 45 



<210> 172 

<211> 47 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> SITE 
<222> (47) 

<223> Xaa equals stop translation 

-<400>-172 : ■ 

Met lie Leu Lys His Tyr lie Leu Thr Phe lie Phe Leu Phe lie Phe 
1 5 10 15 



Leu Phe Phe Met Leu Asn lie Leu His Ser Asn Ser Asn Leu lie Asp 
20 25 30 

Leu Leu Lys Gly Asn lie Arg Phe Arg Leu Leu Asn Ser Met Xaa 
35 40- 45 



<210> 173 

<211> 42 

<212> PRT 

<213> Homo sapiens 
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<220> 

<221> SITE 
<222> (42) 

<22 3> Xaa equals stop translation 
<400> 173 

Met Ala Thr Leu Gin lie Thr Thr Ala Met Lys lie Thr Met Met lie 
1 5 10 15 

Thr Met Val Met lie lie Thr Thr He Val Glu Ala Met Lys He Pro 
20 25 30 

Thr Thr Ala Met Met Met Ala Met Gin Xaa 
35 40 



<210> 174 

<211> 47 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> SITE 
<222> (47) 

<223> Xaa equals stop translation ~ 
<400> 174 

Met Glu Met Leu Ser Ser Lys Trp Ser Lys Arg Val Ala Ala Ser Leu 
1 5 10 15 

Ala His Leu He Ser Leu Phe He Gly Leu Leu Phe Leu Leu Leu Gly 
20 25 30 

Ser Ser Val Tyr Pro Gly Thr Glu Thr Leu Phe Pro Lys Ser Xaa 
35 40 45 



<210> 175 
<211> 41 

<212> PRT • - . - 

<213> Homo sapiens 

<220> 

<221> SITE 
<222> (41) 

<223> Xaa equals stop translation 
<400> 175 

Met Trp" Pro Ser Leu Gly^Arg Cys Cys Leu'Phe Phe ~ Cys Leu Leu Thr 
15 10 15 

Asn Leu Thr Ser Cys His Thr Ser Gin He Thr Leu Cys Ser Arg Glu 
20 25 30 

Thr Cys Val Trp Ser Arg Thr Thr Xaa 
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35 40 



<210> 176 
<211> 53 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (53) 

<223> Xaa equals stop translation 



<400> 176 
Met Tyr Leu Met 
1 

Cys Thr lie Leu 
20 

Ser Thr Pro Arg 
35 



Ser Phe Ser lie 
5 

Val Leu Ser Pro 



Pro Leu Trp Ser 
40 



His Phe Val Lys 
10 

Pro Val Leu Leu 
25 

Gin Cys Lys He 



He He Cys Met 
15 

Lys Tyr Gin Asp 
30 

Pro He Asn Tyr 
45 



Leu Lys Gly Lys Xaa 
50 



<210> 177 
<211> 250 
<212> PRT 

<213> Homo sapiens 



<220> 

<221> SITE 
<222> (250) 

<223> Xaa equals stop translation 
<400> 177 

Met Arg Gly Pro Ser Trp Ser Arg Pro Arg Pro Leu Leu Leu Leu Leu 

r _: 5 ro r5 

. Leu Leu Leu Ser Pro Trp Pro Val Trp Ala Gin Val Ser Ala Arg Ala 
20 25 30 

Ser Pro Ser Gly Ser Leu Gly Ala Pro Asp Cys Pro Glu Val Cys Thr 
35 40 45 

Cy s Val Pro G ly Gly Leu Pr o Ala Val Gly Thr Leu Ala Ala Arg Ar g 

50 55 60 

Ala Pro Gly Pro Glu Pro Ala Pro Ala Arg Ala Ala Ala Gly Pro Gin 
65 70 75 80 

Pro Arg Pro Cys Ala Ala Ala Arg Cys Leu Arg Gly Ser Gly Arg Ala 
85 90 95 
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Thr Ala Pro Gly Pro Ala Arg Glu Arg Ala Ala Leu Gly Ala Cys Ala 
100 105 110 

Ser Leu Leu Gly Pro Gly Arg Ala Ala Ala Ala Gly Pro Glu Arg Gin 
115 120 125 

Pro Ala Gly Ser Thr Gly Thr Arg Asp Phe Arg Ala Ala Ala Arg Ala 
130 135 140 

Ala Gin Pro Leu He Gly Arg Gin Pro Ala Gly Ala Pro Gly Ala Arg 
145 150 155 160 

Gly Ala Arg Arg Ala Pro Ala Ala Ala Leu Thr Gin Pro Ala Gly Gin 
165 170 175 

Arg Ala Gly Gly Thr Arg Ala Gly Ala Ala Gly Pro Pro Ala Arg Ser 
180 185 190 

Arg Arg Ala Ala Pro Ala Arg Gin Pro Leu Gly Leu Arg Val Arg Ala 
195 200 205 

Ala Pro Ala Leu Arg Leu Ala Ala Pro Ala Pro Ala Ala Arg Val Arg 
210 215 220 

Gly Arg Asp Gly Ala Leu- Arg -Val Ala Gly Thr Pro Asp Ala Gin Pro 
225 230 235 240 

Pro Asp Cys Leu Phe Arg Arg Arg Leu Xaa 
245 250 



<210> 178 
<211> 148 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> SITE 
~<~222> (148)" " " 

<223> Xaa equals stop translation 



<400> 178 

Met Leu Ala Gly Ala Gly Arg Pro Gly Leu Pro Gin Gly Arg His Leu 
15 10 15 

Cys Trp Leu Leu Cys Ala Phe Thr Leu Lys Leu Cys Gin Ala Glu Ala 

20 1_25 30 



Pro Val Gin Glu Glu Lys Leu Ser Ala Ser Thr Ser Asn Leu Pro Cys 
35 40 45 

Trp Leu Val Glu Glu Phe Val Val Ala Glu Glu Cys Ser Pro Cys Ser 
50 55 60 
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Asn Phe Arg Ala Lys Thr Thr Pro 
65 70 

Glu Lys lie Thr Cys Ser Ser Ser 
85 

Arg Phe Ser Phe Glu Trp Asn Asn 
100 

Ala Val Val Cys Val Ala Leu lie 
115 120 

Gin Arg Gin Leu Asp Arg Lys Ala 
130 135 

Glu Ser lie Xaa 
145 



Glu Cys Gly Pro Thr Gly Tyr Val 
75 80 

Lys Arg Asn Glu Phe Lys Ser Cys 
90 95 

Ala Tyr Phe Gly Ser Ser Lys Gly 
105 110 

Phe Ala Cys Leu Val lie lie Arg 
125 

Leu Glu Lys Val Arg Lys Gin lie 
140 



<210> 179 

<211> 48 

<212> PRT 

<213> Homo sapiens 

<220> _ ' * 

<221> SITE - - - ~ 

<222> (48) 

<223> Xaa equals stop translation 
<400> 179 

Met Phe Met Cys Arg Leu Leu Leu Trp Ala Thr Gly Ala Tyr Gly Phe 
15 10 15 

Leu Gly Asp Asp Val Glu Tyr Thr Ser Val Leu Pro His Gin Lys Gly 
20 25 30 

Lys Glu Ala Trp Val Phe lie Cys Gin Leu Pro Phe lie lie Gly Xaa 
35 40 45 



<210> 180 

<211> 57 

<212> PRT 

<213> Homo sapiens 



<220> 

<221> SITE 
<222> (56) 

<223> Xaa equals any of the naturally occurring L-amino acids 
<220> 

<221> SITE 
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<222> (57) 

<223> Xaa equals stop translation 
<400> 180 

Met Leu Gin Thr Leu Leu Cys Leu Trp Gin Tyr Thr Ser Ala Gin Val 
1 5 10 15 

Leu Lys Met Leu Cys He His Arg Gin Lys Trp Asp Asn Phe Trp Ala 
20 25 30 

Val Val Met He Asn Leu Leu He Arg lie Gin Arg Leu Pro Phe Ser 
35 40 45 

Leu Pro He Ala Leu Arg Val Xaa Xaa 
50 55 



<210> 181 
<211> 49 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SITE 

<222> (49) _ 

<223> Xaa equals stop translation 

<400> 181 

Met Pro Ser Glu Gly Arg Leu Val Leu Leu Ser Ala Phe Cys Pro Ala 
1 5 10 15 

Phe Phe Pro Pro Trp Val Leu Ser Gly Ser Phe Ala Phe Ser Leu Cys 
20 25 30 

Ala Glu Ser His Leu Asn Ser Ser His Arg Arg He Ala Val Trp Thr 
35 40 45 

Xaa 



<210> 182 

<211> 46 

<212> PRT 

<213> Homo sapiens 

<220> 

_<221>_SITE : 

<222> (46) 

<223> Xaa equals stop translation 
<400> 182 

Met Val Gin Trp Lys Asn Trp Pro Glu Ser Leu Glu Val Trp Val Leu 
15 10 15 
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Val Leu Ala Val Pro Leu Thr His Cys Asp Leu Gly lie Leu Cys Cys 
20 25 30 

Glu Asp He Ser Gin Val Leu His Val Ser Gin Gin He Xaa 
35 40 45 



<210> 183 
<211> 41 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (41) 

<223> Xaa equals stop translation 
<400> 183 

Met Ala Leu Gly Leu Cys Ser Ser Gly Ala Leu Ser Thr Leu Cys Leu 
1 5 10 15 

Ser Ser Val Thr Cys Leu Ala He Met Val Leu Met Ala Val Asp Gly 
20 25 30 

Leu His Gly Thr Ser Gly Leu Gly Xaa 

35 "** - 40~ 



<210> 184 
<211> 80 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (80) 

<223> Xaa equals stop translation 
<400> 184 _ 

Met Thr Leu Met Cys Leu Cys Leu Ser Val Thr Val Leu His ProLeu 
1 5 10 15 

Arg Ser Lys Glu Arg Leu Ser Gly Thr Phe Cys Gly Tyr Ser Ser Ser 
20 25 30 

Trp Cys Ser Pro Ala Ser Glu Ser Ser Ser Pro Gly Ser Leu Leu Thr 
35 40 45 



Cys Ala Ala Ser Gly Ser His Pro Asp Cys Pro Leu Ser Gin Arg Leu 

50 55 60 

Leu Gly Val Gin Leu Ala Ala Leu Gly Arg Pro Gin Gly Leu Phe Xaa 

65 70 75 80 
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<210> 185 
<211> 47 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (47) 

<223> Xaa equals stop translation 
<400> 185 

Met Lys Ser Gin Cys Tyr Ser Pro Ser Tyr Phe Ala Phe Phe Cys Leu 
1 5 10 15 

Val Phe Phe Gin He Thr Ser Ala Ser Ser Gin Thr Leu Arg Gly His 
20 25 30 

Val Leu Cys Arg Thr Thr Leu Arg Asp Ser Ser Ala Tyr Cys Xaa 
35 40 45 



<210> 186 
<211> 141 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (36) 

<223> Xaa equals^ any of the naturally occurring L-amino acids 
<220> 

<221> SITE 
<222> (141) 

<223> Xaa equals stop translation 
<400> 186 . 

Met Phe Leu Phe Gly Gly Phe Leu Met Thr Leu Phe Gly Leu Phe Val 
15 10 15 

Ser Leu Val Phe Leu Gly Gin Ala Phe Thr He Met Leu Val Tyr Val 
20 25 30 

Trp Ser Arg Xaa Asn Pro Tyr Val Arg Met Asn Phe Phe Gly Leu Leu 

35 40- 45 

Asn Phe Gin Ala Pro Phe Leu Pro Trp Val Leu Met Gly Phe Ser Leu 
50 55 60 



Leu Leu Gly Asn Ser He He Val Asp Leu Leu Gly He Ala Val Gly 
65 70 75 80 
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His lie Tyr Phe Phe Leu Glu Asp Val Phe Pro Asn Gin Pro Gly Gly 
85 90 95 

He Arg He Leu Lys Thr Pro Ser He Leu Lys Ala He Phe Asp Thr 
100 105 HO 

Pro Asp Glu Asp Pro Asn Tyr Asn Pro Leu Pro Glu Glu Arg Pro Gly 
115 120 125 

Gly Phe Ala Trp Gly Glu Gly Gin Arg Leu Gly Gly Xaa 
130 135 140 



<210> 187 
<211> 339 
<212> PRT 
<213>' Homo sapiens 

<220> 

<221> SITE 
<222> (339) 

<223> Xaa equals stop translation 



<400> 187 

Met Arg Lys Pro Ala Ala Gly Phe .Leu. Pro Ser Leu Leu .Lys Val Leu 
1 5 • 10 15 

Leu Leu Pro Leu Ala Pro Ala Ala Ala Gin Asp Ser Thr Gin Ala Ser 
20 25 30 

Thr Pro Gly Ser Pro Leu Ser Pro Thr Glu Tyr Glu Arg Phe Phe Ala 
35 40 45 

Leu Leu Thr Pro Thr Trp Lys Ala Glu Thr Thr Cys Arg Leu Arg Ala 
50 55 60 

Thr His Gly Cys Arg Asn Pro Thr Leu Val Gin Leu Asp Gin Tyr Glu 
65 70 75 80 



Asn His Gly Leu Val Pro Asp Gly Ala Val Cys Ser Asn Leu Pro Tyr 
85 90 95 

Ala Ser Trp Phe Glu Ser Phe Cys Gin Phe Thr His Tyr Arg Cys Ser 
100 105 110 

Asn His Val Tyr Tyr Ala Lys Arg Val Leu Cys Ser Gin Pro Val Ser 
115 120 125 



He Leu Ser Pro Asn Thr Leu Lys Glu He Glu Ala Ser Ala Glu Val 
130 135 140 



Ser Pro Thr Thr Met Thr Ser Pro He Ser Pro His Phe Thr Val Thr 
145 150 155 160 

Glu Arg Gin Thr Phe Gin Pro Trp Pro Glu Arg Leu Ser Asn Asn Val 
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165 170 175 

Glu Glu Leu Leu Gin Ser Ser Leu Ser Leu Gly Ser Gin Glu Gin Ala 
180 185 190 

Pro Glu His Lys Gin Glu Gin Gly Val Glu His Arg Gin Glu Pro Thr 
195 200 205 

Gin Glu His Lys Gin Glu Glu Gly Gin Lys Gin Glu Glu Gin Glu Glu 
210 215 220 

Glu Gin Glu Glu Glu Gly Lys Gin Glu Glu Gly Gin Gly Thr Lys Glu 
225 230 235 240 

Gly Arg Glu Ala Val Ser Gin Leu Gin Thr Asp Ser Glu Pro Lys Phe 
245 250 255 

His Ser Glu Ser Leu Ser Ser Asn Pro Ser Ser Phe Ala Pro Arg Val 
260 265 270 

Arg Glu Val Glu Ser Thr Pro Met lie Met Glu Asn lie Gin Glu Leu 
275 280 285 

lie Arg Ser Ala Gin Glu lie Asp Glu Met Asn Glu lie Tyr Asp Glu 

290 _ .295 * . - _ . 300 

Asn Ser Tyr Trp Arg Asn Gin Asn Pro Gly Ser Leu Leu Gin Leu Pro 
305 310 315 320 

His Thr Glu Pro Cys Trp Cys Cys Ala lie Arg Ser Trp Arg lie Pro 
325 330 335 

Ala Ser Xaa 



<210> 188 
<211> 66 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (66) 

<223> Xaa equals stop translation 
<400> 188 

Met^ln_Arg_Il.e„Pro_Th^ 

1 5 10 15 

Trp Ala Met Phe Gin Gly Pro Ala Ala Gly Ser Val Gly Ala Glu Arg 
20 25 30 

Lys Gly Glu Gly Cys Leu Phe Phe Gly Gin Asp Glu Ser Ser Arg Cys 
35 40 45 
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Gly Arg Ser Trp Pro Leu Ala Asp Pro Trp Val Tyr Arg Val Leu Arg 
50 55 60 

Ser Xaa 
65 



<210> 189 
<211> 360 
<212> PRT 
<213> Homo sapiens 



<400> 189 

Met Val Pro Ala Ala Gly Arg Arg Pro Pro Arg Val Met Arg Leu Leu 
1 "5 10 15 

Gly Trp Trp Gin Val Leu Leu Trp Val Leu Gly Leu Pro Val Arg Gly 
20 25 30. 

Val Glu Val Ala Glu Glu Ser Gly Arg Leu Trp Ser Glu Glu Gin Pro 
35 40 45 

Ala His Pro Leu Gin Val Gly Ala Val Tyr Leu Gly Glu Glu Glu Leu 

50 _ 55 ' ' ; - _ 60 

Leu His Asp Pro Met Gly Gin Asp Arg Ala Ala Glu Glu Ala Asn Ala 
65 70 75 80 

Val Leu Gly Leu Asp Thr Gin Gly Asp His Met Val Met Leu Ser Val 
85 90 95 

lie Pro Gly Glu Ala Glu Asp Lys Val Ser Ser Glu Pro Ser Gly Val 
100 105 110 

Thr Cys Gly Ala Gly Gly Ala Glu Asp Ser Arg Cys Asn Val Arg Glu 
115 120 ' 125 

Ser Leu Phe Ser Leu Asp Gly Ala Gly Ala His Phe Pro Asp Arg Glu 
r3~0" 135 - 140" ~ 



Glu Glu Tyr Tyr Thr Glu Pro Glu Val Ala Glu Ser Asp Ala Ala Pro 
145 150 155 160 

Thr Glu Asp Ser Asn Asn Thr Glu Ser Leu Lys Ser Pro Lys Val Asn 
165 170 175 

C ys Glu Glu Arg Asn lie Thr Gly -Leu Glu A sn Phe Thr Leu Lys lie 
180 185 190 

Leu Asn Met Ser Gin Asp Leu Met Asp Phe Leu Asn Pro Asn Gly Ser 
' 195 200 205 

Asp Cys Thr Leu Val Leu Phe Tyr Thr Pro Trp Cys Arg Phe Ser Ala 
210 215 220 
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Ser Leu Ala Pro 
225 

His Phe Leu Ala 



Phe Gly Thr Val 
260 

Pro Met Ala Arg 
275 

lie Phe lie Phe 
290 

Val Thr Gin Ala 
305 

Ser Val Asp Trp 



lie Met Tyr Ala 
340 

Gly Gin Glu Gin 

355 



His Phe Asn Ser 
230 

Leu Asp Ala Ser 
245 

Ala Val Pro Asn 



Phe Asn His Thr 
280 

Asn Gin Thr Gly 
295 

Asp Gin lie Gly 
310 

Leu Leu Val Phe 
325 

Thr lie Arg Thr 



Glu His Val Glu 
360 



Leu Pro Arg Ala 
.235 

Gin His Ser Ser 
250 

lie Leu Leu Phe 
265 

Asp Arg Thr Leu 



lie Glu Ala Lys 
300 

Pro Leu Pro Ser 
315 

Ser Leu Phe Phe 
330 

Glu Ser lie Arg 
345 



Phe Pro Ala Leu 
240 

Leu Ser Thr Arg 
255 

Gin Gly Ala Lys 
270 

Glu Thr Leu Lys 
285 

Lys Asn Val Val 



Thr Leu lie Lys 
320 

Leu lie Ser Phe 
335 

Trp Leu lie Pro 
350 



<210> 190 
<211>- 160 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (160) 

<223> Xaa equals stop translation 

<400> 190 ~ ~ 

Met Leu Leu Leu Leu lie Phe Trp lie Ala Pro Ala His Gly Pro Thr 
15 10 15 

Asn lie Met Val Tyr lie Ser lie Cys Ser Leu Leu Gly Ser Phe Thr 
20 25 30 

Val Pro Ser Thr Lys Gly lie Gly Leu Ala Ala Gin Asp lie Leu His 

35 4 45 - - 

Asn Asn Pro Ser Ser Gin Arg Ala Leu Cys Leu Cys Leu Val Leu Leu 
50 55 60 



Ala Val Leu Gly Cys Ser lie lie Val Gin Phe Arg Tyr lie Asn Lys 
65 70 75 80 
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Ala Leu Glu Cys Phe Asp Ser Ser 
85 

Val Phe Thr Thr Leu Val Leu Leu 
100 

Trp Ser Asn Val Gly Leu Val Asp 
115 120 

Thr Thr Val Ser Val Gly lie Val 
130 135 

Asn Phe Asn Leu Gly Glu Met Asn 
145 150 



Val Phe Gly Ala lie Tyr Tyr Val 
90 95 

Ala Ser Ala lie Leu Phe Arg Glu 
105 HO 

Phe Leu Gly Met Ala Cys Gly Phe 
125 

Leu lie Gin Val Phe Lys Glu Phe 
140 

Lys Ser Asn Met Lys Thr Asp Xaa 
155 160 



<210> 191 
<211> 101 
<212> PRT 
<213> Homo sapiens 

<220> _ ' ; _ . . ' - 

<221> SITE " 
<222> (92) 

<223> Xaa equals any of the naturally occurring L-amino acids 
<220> 

<221> SITE 
<222> (96) 

<22 3> Xaa equals any of the naturally occurring L-amino acids 



<220> 

<221> SITE 
<222> (101) 

<223> Xaa equals stop translation 
<400>" 191 

Met Phe Val Ala Val Phe Tyr Trp Val Leu Thr Val Phe Phe Leu lie 
1 5 10 15 

lie Tyr lie Thr Met Thr Tyr Thr Arg lie Pro Gin Val Pro Trp Thr 
20 25 30 

Thr Val Gly Leu Cys Phe Asn Gly Ser Ala Phe Val Leu Tyr Leu Ser 

33 : 40^ 45 

Ala Ala Val Val Asp Ala Ser Ser Val Ser Pro Glu Lys Asp Ser His 
50 55 60 

Asn Phe Asn Ser Trp Ala Ala Ser Ser Phe Phe Ala Phe Leu Val Thr 
65 70 75 80 
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lie Cys Tyr Ala Gly Asn Thr Tyr Phe Ser Phe Xaa Ala Trp Arg Xaa 
85 90 95 

Arg Thr lie Gin Xaa 
100 



<210> 192 
<211> 43 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (43) 

<223> Xaa equals stop translation 
<400> 192 

Met Phe Lys Leu Gin Leu Asp Leu Leu Thr Ala Val Asn Leu Val Tyr 
1 5 10 15 

Phe Ser Phe Leu Trp Val Val Ser Val Ala Asn Lys Met Asp Val Ser 
20 25 30 

Val Phe Glu Leu Val_ Asn . Ser Asp - Cys Phe Xaa 
35 - -40" 



<210> 193 
<211> 62 
<212> PRT 

<213> Homo sapiens 



<220> 

<221> SITE 
<222> (621 

<223> Xaa equals stop translation 



<400> 193 

Met Ser Val Cys Val Phe Leu Asp Phe Arg Leu lie Phe Trp Ser Phe 

1 5 10 15 

Cys Pro Cys Ser Ala Ser Pro Ser Arg His Phe Ala Ser Ser Ser Arg 

20 25 30 

Gly Gly Gly Gly Gly Ser Arg Asn Trp Val Gly Ala Gly Ala Ser Leu 

35 40 45 



Ala Ala Ser Leu Ala Leu Tyr Ala Leu Ser Pro Arg Arg Xaa 
50 55 60 



<210> 194 
<211> 53 
<212> PRT 
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<213> Homo sapiens 



<220> 

<221> SITE 
<222> (53) 

<223> Xaa eqfuals stop translation 



<400> 194 

Met Gin Ala Gin lie Ser Ser Pro Arg Trp Thr Ser Trp Phe Ser Leu 
1 5 10 15 

Thr Ala Val Thr Leu Ala Phe Pro Ser Leu lie Pro Tyr Pro Ser Cys 
20 25 30 

Gly lie Pro Val Leu Thr Gin Asp Ala Lys Trp Pro Ser Asp Tyr Thr 
35 40 45 

Ser Pro Asp Ser Xaa 
50 



<210> 195 
<211> 186 
<212> PRT 

<213> Homo sapiens " . 

<220> 

<221> SITE 
<222> (114) 

<223> Xaa equals any of the naturally occurring L-amino acids 



<220> 

<221> SITE 
<222> (186) 

<223> Xaa ecjuals stop translation 



<400> 195 

Met Thr Leu Leu Asn Leu Leu Leu 
1 5 



Cys Leu Asp Asp Val Leu Lys Arg 
20 



Gin Thr lie Phe Tyr Gly Val Thr 
10 15 



Thr Lys Gly Gly Lys Asp lie Lys 
25 30 



Phe Leu Thr Ala Phe Arg Asp Leu Leu Phe Thr Thr Leu Ala Phe Pro 
35 40 45 

Val Ser Thr Phe Val Phe Leu Ala Phe Trp lie Leu Phe Leu Tyr Asn 

50 5JL__J 60 



Arg Asp Leu lie Tyr Pro Lys Val Leu Asp Thr Val lie Pro Val Trp 

65 70 75 80 

Leu Asn His Ala Met His Thr Phe lie Phe Pro lie Thr Leu Ala Glu 

85 90 95 
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Val Val Leu Arg Pro His Ser Tyr 
100 

Leu Xaa Ala Ala Ala Ser lie Ala 

115 120 

Tyr Phe Glu Thr Gly Thr Trp Val 
130 135 

Leu Leu Gly Leu Ala Ala Phe Phe 
145 150 

Ser lie Tyr Leu Leu Gly Glu Lys 
165 

Met Arg Gin Pro Arg Lys Lys Arg 
180 



Pro Ser Lys Lys Thr Gly Leu Thr 
105 HO 

Tyr lie Ser Arg lie Leu Trp Leu 
125 

Tyr Pro Val Phe Ala Lys Leu Ser 
140 

Ser Leu Ser Tyr Val Phe lie Ala 

155 160 

Leu Asn His Trp Lys Trp Gly Asp 
170 175 

Lys Xaa 
185 



<210> 196 

<211> 77 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> SITE - ."*."." - ' 

<222> (77) 

<223> Xaa equals stop translation 

<400> 196 

Met Lys Asn Ala Thr Leu Leu Arg Met Val Leu Phe Val lie Asn Leu 
1 5 10 15 

Gin Asn Leu Lys Ser Cys Pro Val Leu His lie His Gin Asp Val Gin 
20 25 30 

Gin Gin Lys Arg Met Gly His Gly Gly Ser Ser Thr Arg Val Thr Val 
35 40 45 



Thr Ser Leu lie Arg His Cys Thr Val Phe Gin Arg Pro Lys Asn Cys 
50 55 60 

Val' Gin Asn Met lie Thr Leu Gin Leu Ser Phe Pro Xaa 
65 70 75 



<210> 197 

<211> 175 _ :_ 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> SITE 
<222> (175) 

<223> Xaa equals stop translation 
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<400> 197 

Met Phe Val Pro Ser Cys Leu Cys Leu Arg Phe Val Val Thr Ser Leu 
15 10 15 

Leu Leu Gin Met Thr His Ser Cys Gly Gly Phe Tyr lie Cys Val lie 
20 25 30 

Phe Glu Thr lie Leu Ser Glu Phe Lys Thr Gin He Gly Arg Leu Tyr 
35 40 45 

Arg Lys Arg His He Gin Arg Lys Glu Ser Pro Lys Gly Arg Phe Val 
50 55 60 

Met Leu Leu Pro Ser Ser Thr His Thr He Pro Phe Tyr Pro Asn Pro 
65 70 75 80 

Leu His Pro Arg Pro Phe Pro Ser Ser Arg Leu Pro Pro Gly He He 
85 90 95 

Gly Gly Glu Tyr Asp Gin Arg Pro Thr Leu Pro Tyr Val Gly Asp Pro 
100 105 110 

He Ser Ser Leu He Pro Gly Pro Gly Glu Thr Pro Ser Gin Phe Pro 
115 _ . . 120. 125. 

Pro Leu Arg Pro Arg Phe Asp Pro Val Gly Pro Leu Pro Gly Pro Asn 
130 135 140 

Pro He Leu Pro Gly Arg Gly Gly Pro Asn Asp Arg Phe Pro Phe Arg 
145 150 155 160 

Pro Ser Arg Gly Arg Pro Thr Asp Gly Arg Leu Ser Phe Met Xaa 
165 170 175 



<210> 198 
<211> 51 
<212> PRT 



<213> Homo sapiens 
<220> 

<221> SITE 

<222> (51) 

<223> Xaa equals stop translation 

<400> 198 

^eJt_Gly„LejLi„Lys_Arg_Lys„Gl^ 

15 10 15 

Lys Ser Thr Val Ala Ser Trp Leu Leu Ser Gly Val Gly Arg He Trp 
20 25 30 

Gly Leu Val His Phe Val Lys Val Asn His Val Cys Leu Asn Asn Arg 
35 40 45 
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Gly Val Xaa 
50 



<210> 199 
<211> 190 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> SITE 
<222> (190) 

<223> Xaa equals stop translation 
<400> 199 

Met Gly Pro Val Arg Leu Gly lie Leu Leu Phe Leu Phe Leu Ala Val 
1 5 10 15 

His Glu Ala Trp Ala Gly Met Leu Lys Glu Glu Asp Asp Asp Thr Glu 
20 25 30 

Arg Leu Pro Ser Lys Cys Glu Val Cys Lys Leu Leu Ser Thr Glu Leu 
35 40 45 

Gin Ala Glu Leu Ser Arg Thr* Gly Arg Ser Arg Glu Val Leu Glu Leu 
50 55 60 

Gly Gin Val Leu Asp Thr Gly Lys Arg Lys Arg His Val Pro Tyr Ser 
65 70 75 80 

Val Ser Glu Thr Arg Leu Glu Glu Ala Leu Glu Asn Leu Cys Glu Arg 
85 90 95 

lie Leu Asp Tyr Ser Val His Ala Glu Arg Lys Gly Ser Leu Arg Tyr 
100 105 110 

Ala Lys Gly Gin Ser Gin Thr Met Ala Thr Leu Lys Gly Leu Val Gin 

115 120 125 

Lys Gly Val Lys Val Asp Leu Gly lie Pro Leu Glu Leu Trp Asp Glu 
130 135 140 

Pro Ser Val Glu Val Thr Tyr Leu Lys Lys Gin Cys Glu Thr Met Leu 
145 150 155 160 

Glu Glu Glu Glu Glu Glu Glu Glu Glu Glu Gly Gly Asp Lys Met Thr 

1-65 : 170 — 1-7-5-= 



Lys Thr Gly Ser His Pro Lys Leu Asp Arg Glu Asp Leu Xaa 
180 185 190 



<210> 200 
<211> 80 
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<212> PRT 

<213> Homo sapiens 

<220> 

<221> SITE 

<222> (80) 

<22 3> Xaa equals stop translation 

<400> 200 

Met Asn Tyr Ser Arg Ser Pro Trp Ala Ala Val Met Glu Pro Leu Thr 
1 5 10 15 

Leu Leu Phe Leu His Leu Ser Cys Leu Leu Ser Leu Cys Glu Ala Val 
20 25 30 

Gly Trp Asp Ser Glu Cys Leu Val Cys Ser Leu Gly Glu Glu Glu Phe 
35 40 45 

Leu Arg Met Gin Ala Leu Leu Cys Gly. Cys Arg Leu His Leu Gly Gly 
50 55 60 

Val Leu Tyr Val Cys Thr Leu Gly Thr Ala Cys lie Trp Lys lie Xaa 
65 70 75 80 



<210> 201 

<211> 106 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> SITE 
<222> (106) 

<223> Xaa equals stop translation 

<400> 201 

Met Asn Leu Gly Val Ser Met Leu Arg lie Leu Phe Leu Leu Asp Val 
15 10 15 

Gly Gly Ala Gin Val Leu Ala Thr Gly Lys Thr Pro Gly Ala Glu He 
20 25 30 

Asp Phe Lys Tyr Ala Leu He Gly Thr Ala Val Gly Val Ala He Ser 
35 40 45 



Ala Gly Phe Leu Ala Leu Lys He Cys Met He Arg Arg His Leu Phe 
50 55 60 

Asp Asp Asp Ser Ser Asp Leu Lys Ser Thr Pro Gly Gly Leu Ser Asp 

65 70 75 80 

Thr He Pro Leu Lys Lys Arg Ala Pro Arg Arg Asn His Asn Phe Ser 
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85 90 95 



Lys Arg Asp Ala Gin Val lie Glu Leu Xaa 
100 105 



<210> 202 
<211> 80 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (80) 

<223> Xaa equals stop translation 
<400> 202 

Met Ala Cys Leu Gly Gly Leu Leu Gly lie lie Gly Val lie Cys Leu 
1 5 10 15 

lie Ser Cys Leu Ser Pro Glu Met Asn Cys Asp Gly Gly His Ser Tyr 
20 25 30 

Val Arg Asn Tyr Leu Gin Lys Pro Thr Phe Ala Leu Gly Glu Leu Tyr 

35 — '..40. - _ . . • '45. 

Pro Pro Leu lie Asn Leu Trp Glu Ala - Gly Lys Glu Lys Ser Thr Ser 
50 55 60 

Leu Lys Val Lys Ala Thr Val lie Gly Leu Pro Thr Asn Met Ser Xaa 
65 70 75 80 



<210> 203 
<211> 58 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> SITE 
<222> (58) 

<223> Xaa equals stop translation 
<400> 203 

- Me t-Gly- Leu-Lys— Leu-Leu -Gln-Lys— Pro -Gly— Ser— Leu-Lys -Thr- Leu -I-l-e- 
.15 10 15 

Ala lie lie Leu Val Met Tyr lie Phe Met Thr lie Ser Val lie Ala 
20 25 30 

Gly Thr Gly Lys Phe Ser Gin Lys Leu Asp Leu His Leu Asn Met Asp 
35 40 45 



Ducrwirv ^un OMTe^nAi i „ 
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He Ser Pro Gly Arg Pro Ser Val His Xaa 
50 55 



<210> 204 

<211> 161 

<212> PRT 

<213> Homo sapiens 

<400> 204 

Met Asp Phe Leu Asn Pro Asn Gly Ser Asp Cys Thr Leu Val Leu Phe 
1 5 10 15 

Tyr Thr Pro Trp Cys Arg Phe Ser Ala Ser Leu Ala Pro His Phe Asn 
20 25 30 

Ser Leu Pro Arg Ala Phe Pro Ala Leu His Phe Leu Ala Leu Asp Ala 
35 40 45 

Ser Gin His Ser Ser Leu Ser Thr Arg Phe Gly Thr Val Ala Val Pro 
50 55 60 

Asn He Leu Leu Phe Gin Gly Ala Lys Pro Met Ala Arg Phe Asn His 

65 _ 70 "...* 75 " 80 

.Thr Asp Arg Thr Leu Glu Thr Leu Lys lie Phe He Phe Asn Gin Thr 
85 90 95 

Gly He Glu Ala Lys Lys Asn Val Val Val Thr Gin Ala Asp Gin He 
100 105 110 

Gly Pro Leu Pro Ser Thr Leu He Lys Ser Val Asp Trp Leu Leu Val 
115 120 125 

Phe Ser Leu Phe Phe Leu He Ser Phe He Met Tyr Ala Thr He Arg 
130 135 140 

Thr Glu Ser He Arg_Trp Leu lie Pro Gly Gin Glu Gin Glu His Val 
145" 150 155 " 160 

Glu 



<210> 205 
<211> 137 
-<212^PRT_ 



<213> Homo sapiens 
<220> 

<221> SITE 
<222> (10) 

<223> Xaa equals any of the naturally occurring L-amino acids 
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<400> 205 
lie Pro Glu Asn 
1 

Thr Ser Arg Thr 
20 

Val Ser Ser Ala 
35 

Ser Thr Ser Cys 
50 

Cys Thr Pro Ser 
65 

Leu Glu Leu Pro 



Thr Tyr Arg Cys 
100 

Ala" Tyr Glu Met 
115 

Lys Phe Leu Leu 
130 



Arg Arg Pro Ala 
5 

Thr Thr Arg Arg 



Ser Val Ser Ser 
40 

Cys Arg Ser Ser 
55 

Ala Ser Thr Glu 
70 

Val Val His Thr 
85 

Ser Ala Gly Asp 



Gly Glu Glu Met 
120 

Phe His Phe Tyr 
135 



Ser Xaa Cys Thr 
10 

Pro Pro Trp Gly 
25 

Thr Arg Lys Thr 



Arg Arg Arg Val 
60 

Pro Ser Ala Arg 
75 

Phe Ser Phe Leu 
90 

Gly Ser lie Thr 
105 

Pro Lys Arg Gin 



Leu 



Trp Ser Met Trp 

15 " 

Arg Phe Ser Ser 
30 

Trp Arg Thr Arg 
45 

Ala Ala Pro Phe 



Met Glu Pro Pro 
80 

Thr Phe Val Phe 
95 

Gin lie Asn Cys 
110 

Met Lys Ala lie 
125 



<210> 206 

<211> 41. 

<212> PRT 

<213> Homo sapiens 



<220> 

<221> SITE 
<222> (10) 

<223> Xaa equals any of the naturally occurring L-amino acids 
"<400> 206 ~ 

lie Pro Glu Asn Arg Arg Pro Ala Ser Xaa Cys Thr Trp Ser Met Trp 
15 10 15 

Thr Ser Arg Thr Thr Thr Arg Arg Pro Pro Trp Gly Arg Phe Ser Ser 
20 25 30 

Val Ser Ser Ala Ser Val Ser Ser Thr 

3JL L 4 0_i 



<210> 207 

<211> 43 

<212> PRT 

<213> Homo sapiens 
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<400> 207 
Arg Lys Thr Trp 
1 

Arg Arg Val Ala 
20 

Ser Ala Arg Met 
35 



Arg Thr Arg Ser 
5 

Ala Pro Phe Cys 

Glu Pro Pro Leu 
40 



Thr Ser Cys Cys 
10 

Thr Pro Ser Ala 
25 

Glu Leu Pro 



Arg Ser Ser Arg 
15 

Ser Thr Glu Pro 
30 



<210> 208 
<211> 53 
<212> PRT 

<213> Homo sapiens 
<400> 208 

Val Val His Thr Phe Ser Phe Leu Thr Phe -Val Phe Thr Tyr Arg Cys 
15 10 15 

Ser Ala Gly Asp Gly Ser lie Thr Gin lie Asn Cys Ala Tyr Glu Met 
20 25 30 

Gly Glu Glu Met Pro Lys Arg Gin Met Lys Ala lie Lys Phe Leu Leu 

35 40 „ " _. 45 - 

Phe His Phe Tyr Leu 
50 



<210> 209 
<211> 223 
<212> PRT 

<213> Homo sapiens 
<400> 209 

His Pro Ser lie lie He Trp Ser Gly Asn Asn Glu Asn Glu Glu Ala 
15 10 15 

Leu Met Met Asn Trp Tyr His He Ser Phe Thr Asp Arg Pro He Tyr 
20 25 30 

He Lys Asp Tyr Val Thr Leu Tyr Val Lys Asn He Arg Glu Leu Val 
35 40 45 

Leu Ala Gly Asp Lys Ser Arg Pro Phe He Thr Ser Ser Pro Thr Asn 
50 55 60 



Gly Ala Glu Thr Val Ala Glu Ala Trp Val Ser Gin Asn Pro Asn Ser 

65 70 75 80 

Asn Tyr Phe Gly Asp Val His Phe Tyr Asp Tyr He Ser Asp Cys Trp 

85 90 95 

Asn Trp Lys Val Phe Pro Lys Ala Arg Phe Ala Ser Glu Tyr Gly Tyr 
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100 105 110 

Gin Ser Trp Pro Ser Phe Ser Thr Leu Glu Lys Val Ser Ser Thr Glu 
115 120 125 

Asp Trp Ser Phe Asn Ser Lys Phe Ser Leu His Arg Gin His His Glu 
130 135 140 

Gly Gly Asn Lys Gin Met Leu Tyr Gin Ala Gly Leu His Phe Lys Leu 
145 150 155 160 

Pro Gin Ser Thr Asp Pro Leu Arg Thr Phe Lys Asp Thr lie Tyr Leu 
165 170 175 

Thr Gin Val Met Gin Ala Gin Cys Val Lys Thr Glu Thr Glu Phe Tyr 
180 185 190 

Arg Arg Ser Arg Ser Glu lie Val Asp Gin Gin Gly His Thr Met Gly 
195 200 205 

Ala Leu Tyr Trp Gin Leu Asn Asp lie Trp Gin Ala Pro Ser Trp 
210 215 220 



<210> 210 _ . - 

<211> 160 

<212> PRT 

<213> Homo sapiens 

<400> 210 

Val Arg Val His Thr Trp Ser Ser Leu Glu Pro Val Cys Ser Arg Val 
15 10 15 

Thr Glu Arg Phe Val Met Lys Gly Gly Glu Ala Val Cys Leu Tyr Glu 
20 25 30 

Glu Pro Val Ser Glu Leu Leu Arg Arg Cys Gly Asn Cys Thr Arg Glu 
35 40 45 

Ser Cys Val Val Ser Phe Tyr Leu Ser Ala Asp His Glu Leu Leu Ser 
50 55 60 

Pro Thr Asn Tyr His Phe Leu Ser Ser Pro Lys Glu Ala Val Gly Leu 
65 70 75 80 

Cys Lys Ala Gin lie Thr Ala lie lie Ser Gin Gin Gly Asp lie Phe 
85 90 , 95 



Val Phe Asp Leu Glu Thr Ser Ala Val Ala Pro Phe Val Trp Leu Asp 
100 105 110 

Val Gly Ser lie Pro Gly Arg Phe Ser Asp Asn Gly Phe Leu Met Thr 
115 120 125 

Glu Lys Thr Arg Thr lie Leu Phe Tyr Pro Trp Glu Pro Thr Ser Lys 
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130 135 140 

Asn Glu Leu Glu Gin Ser Phe His Val Thr Ser Leu Thr Asp lie Tyr 
145 150 155 160 



<210> 211 
<211> 171 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (102) 

<223> Xaa equals any of the naturally occurring L-amino acids 
<400> 211 

Pro Arg Leu Thr Pro Arg Met Lys Trp Pro Thr Ala Ala Leu Ala Ser 
15 10 15 

Arg Leu Leu Gly Trp Thr Val Leu Arg Pro Pro Tyr Pro Arg Val Pro 

20 _ - ?5 . 30 

Ser Leu Pro Gin Val Thr Leu His Pro Thr Asp Gly Leu Met Ala Val 
35 40 45 

Leu Tyr Thr Gly Gly Glu Gly Arg Thr Leu Gly Glu Gin His Phe Phe 
50 55 60 

His Glu Thr Phe Val Thr Arg Trp Leu Leu Gly Pro Val Pro Val Arg 
65 70 75 80 

Phe Gly Ala Cys Ser Pro Leu Ser Phe Leu Ala Pro Arg Arg Gly Gin 
85 90 95 

Gly Ala Pro Ala Gly Xaa Phe Cys Ala Cys Pro Arg Pro Ala Ser Arg_ 
100 . 105 "HO 

Gin Leu Cys Pro Trp Pro Ala Leu Pro Gly Thr Pro Tyr Ser Asn Ser 
115 120 125 

Ala Pro Leu Cys Thr Gly Met Gly His Ser Asn Thr Pro Gin Gly Pro 
130 135 140 

-Pro_.Ser_Pro_Gln-Tyr-Ala-Leu-Ser-Pro-Thr-Glu_Pro~Thr-Ser-Leu-Ser- 
145 150 155 160 

Gly Asn Ser His Leu Pro Ala lie Leu Val Leu 
165 170 



<210> 212 
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<211> 41 

<212> PRT 

<213> Homo sapiens 

<400> 212 

Pro Arg Leu Thr Pro Arg Met Lys Trp Pro Thr Ala Ala Leu Ala Ser 
1 5 10 15 

Arg Leu Leu Gly Trp Thr Val Leu Arg Pro Pro Tyr Pro Arg Val Pro 
20 25 30 

Ser Leu Pro Gin Val Thr Leu His Pro 
35 40 



<210> 213 

<211> 41 

<212> PRT 

<213> Homo sapiens 

<400> 213 

Thr Asp Gly Leu Met Ala Val Leu Tyr Thr Gly Gly Glu Gly Arg Thr 
1 5 10 15 

Leu Gly Glu Gin His Phe Phe "His -Glu Thr Phe Val Thr Arg Trp Leu 

20 " " 25 • "' 30 

. Leu Gly Pro Val Pro Val Arg Phe Gly 
35 40 



<210> 214 

<211> 42 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> SITE 

<222> (20) _ 

<223> Xaa equals any of the naturally occurring L-amino acids 
<400> 214 

Ala Cys Ser Pro Leu Ser Phe Leu Ala Pro Arg Arg Gly Gin Gly Ala 
15 10 15 

Pro Ala Gly Xaa Phe Cys Ala Cys Pro Arg Pro Ala Ser Arg Gin Leu 
20 25 30 



Cys Pro Trp Pro Ala Leu Pro Gly Thr Pro 
35 40 



<210> 215 
<211> 47 
<212> PRT 
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<213> Homo sapiens 
<400> 215 

Tyr Ser Asn Ser Ala Pro Leu Cys Thr Gly Met Gly His Ser Asn Thr 
15 10 15 

Pro Gin Gly Pro Pro Ser Pro Gin Tyr Ala Leu Ser Pro Thr Glu Pro 
20 25 30 

Thr Ser Leu Ser Gly Asn Ser His Leu Pro Ala lie Leu Val Leu 
35 40 45 



<210> 216 

<211> 27 

<212> PRT 

<213> Homo sapiens 

<400> 216 

His Leu Leu Glu Val Thr Pro Cys Arg Leu Pro Val Pro Glu Phe Pro 
1 5 10 15 

Gly Arg Thr Pro Arg Gly Ser Arg Thr Pro Asp 
20 25 



<210> 217 
<211> 239 
<212> PRT 
<213> Homo sapiens 

<400> 217 

Met lie Pro Gly Ser Asp Ser Gin Thr Ala Leu Asn Phe Gly Ser Thr 
15 10 15 

Leu Met Lys Lys Lys Ser Asp Pro Glu Gly Pro Ala Leu Leu Phe Pro 
20 25 30 

Glu Ser Glu Leu Ser lie Arg lie Gly Arg Ala Gly Leu Leu Ser Asp 
35 * 40 " 45 

Lys Ser Glu Asn Gly Glu Ala Tyr Gin Arg Lys Lys Ala Ala Ala Thr 
50 55 60 

Gly Leu Pro Glu Gly Pro Ala Val Pro Val Pro Ser Arg Gly Asn Leu 
65 70 75 80 

^Ala__Gln Pro_Gly— Gly_Ser -Ser_ TrpJ_ Arg. Arg_Ile_ Ala-Leu -Leu-Ile-Leu- 

85 90 95 

Ala lie Thr lie His Asn Val Pro Glu Gly Leu Ala Val Gly Val Gly 
100 105 110 

Phe Gly Ala lie Glu Lys Thr Ala Ser Ala Thr Phe Glu Ser Ala Arg 
115 120 125 
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Asn Leu Ala lie Gly lie Gly He Gin Asn Phe Pro Glu Gly Leu Ala 
130 135 140 

Val Ser Leu Pro Leu Arg Gly Ala Gly Phe Ser Thr Trp Arg Ala Phe 
145 150 155 160 

Trp Tyr Gly Gin Leu Ser Gly Met Val Glu Pro Leu Ala Gly Val Phe 
165 170 175 

Gly Ala Phe Ala Val Val Leu Ala Glu Pro He Leu Pro Tyr Ala Leu 
180 185 190 

Ala Phe Ala Ala Gly Ala Met Val Tyr Val Val Met Asp Asp He He 
195 200 205 

Pro Glu Ala Gin He Ser Gly Asn Gly Lys Leu Ala Ser Trp Ala Ser 
210 215 220 

He Leu Gly Phe Val Val Met Met Ser Leu Asp Val Gly Leu Gly 
225 230 235 



<210> 218 

<211> 43 _ 

<212> PRT ' * 

<213> Homo sapiens 

<400> 218 

Met He Pro Gly Ser Asp Ser Gin Thr Ala Leu Asn Phe Gly Ser Thr 
1 5 10 15 

Leu Met Lys Lys Lys Ser Asp Pro Glu Gly Pro Ala Leu Leu Phe Pro 
20 25 30 

Glu Ser Glu Leu Ser He Arg He Gly Arg Ala 
35 40 



<210> 219 
<211> 41 
<212> PRT 

<213> Homo sapiens 
<400> 219 

Gly Leu Leu Ser Asp Lys Ser Glu Asn Gly Glu Ala Tyr Gin Arg Lys 
15 10 15 



Lys Ala Ala Ala Thr Gly Leu Pro Glu Gly Pro Ala Val Pro Val Pro 
20 25 30 

Ser Arg Gly Asn Leu Ala Gin Pro Gly 
35 40 
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<210> 220 

<211> 44 

<212> PRT 

< 2 1 3 > Homo sapi ens 

<400> 220 

Gly Ser Ser Trp Arg Arg lie Ala Leu Leu lie Leu Ala . lie Thr lie 
15 10 15 

His Asn Val Pro Glu Gly Leu Ala Val Gly Val Gly Phe Gly Ala lie 
20 25 30 

Glu Lys Thr Ala Ser Ala Thr Phe Glu. Ser Ala Arg 
35 40 



<210> 221 

<211> 43 

<212> PRT 

<213> Homo sapiens 

<400> 221 

Asn Leu Ala lie Gly lie Gly lie Gin Asn Phe Pro Glu Gly Leu Ala 
1 5 10 15 

Val Ser Leu Pro Leu Arg- Gly -Ala -Gly Phe Ser Thr Trp Arg Ala Phe 
20 25 30 

Trp Tyr Gly Gin Leu Ser Gly Met Val Glu Pro 
35 40 



<210> 222 

<211> 43 

<212> PRT 

<213> Homo sapiens 

<400> 222 

Leu Ala Gly Val Phe Gly Ala Phe Ala Val Val Leu Ala Glu Pro He 



15 10 15 

Leu Pro Tyr Ala Leu Ala Phe Ala Ala Gly Ala Met Val Tyr Val Val 
20 25 30 

Met Asp Asp He He Pro Glu Ala Gin He Ser 
35 40 



<210> 223 

<211> 25 

<212> PRT 

<213> Homo sapiens 

<400> 223 

Gly Asn Gly Lys Leu Ala Ser Trp Ala Ser He Leu Gly Phe Val Val 
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10 15 



Met Met Ser Leu Asp Val Gly Leu Gly 
20 25 



<210> 224 
<211> 11 
<212> PRT 

<213> Homo sapiens 
<400> 224 

Thr Arg Pro He Thr Tyr Val Leu Leu Ala Gly 
1 5 10 



<210> 225 
<211> 35 
<212> PRT 

<213> Homo sapiens 
<400> 225 

Gly Thr Ser Leu Thr Ala Pro Leu Leu Glu Phe Leu Leu Ala Leu Tyr 
1 5 10 15 

Phe Leu Phe Ala Asp Ala Met -Gin Leu Asn Asp Lys Trp Gin Gly Leu 
20 25 30 

Cys Trp Pro 
35 



<210> 226 
<211> 30 
<212> PRT 

<213> Homo sapiens 
<400> 226 

Leu Ala Asn Phe Glx Cys Ser Asp Cys Ala Gin Thr Val Leu Phe Val 
1 5 10 15 

Leu Glx Phe Glx He Leu Val Phe Thr Tyr Glu He Pro Phe 
20 25 30 



<210> 227 
<211> 75 

<2.12>_PRT_ : 

<213> Homo sapiens 

<400> 227 

Gin Ala Trp His Glu Val Gly Gly Gly Val Arg Arg Cys Trp Phe Val 
1 5 10 15 

Leu Gly Glu Arg Arg Ala Gly Ser Leu Leu Ser Ala Ser Tyr Gly Thr 
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20 25 30 

Phe Ala Met Pro Gly Met Val Leu Phe Gly Arg Arg Trp Ala lie Ala 
3 5 40 45 

Ser Asp Asp Leu Val Phe Pro Gly Phe Phe Glu Leu Val Val Arg Val 
50 55 60 

Leu Trp Trp lie Gly lie Leu Thr Leu Tyr Leu 
65 70 75 



<210> 228 
<211> 125 
<212> PRT 
<213> Homo sapiens 



<400> 228 

Pro Gly Met Val Leu Phe Gly Arg Arg Trp Ala lie Ala Ser Asp Asp 
15 10 15 

Leu Val Phe Pro Gly Phe Phe Glu Leu Val Val Arg Val Leu Trp Trp 
20 25 30 

lie Gly lie Leu Thr Leu - Tyr Leu. Met His Arg Gly Lys Leu Asp Cys 

35 " : - -40- 45 * 

Ala Gly Gly Ala Leu Leu Ser Ser Tyr Leu lie Val Leu Met lie Leu 
50 55 60 

Leu Ala Val Val lie Cys Thr Val Ser Ala lie Met Cys Val Ser Met 
65 70 75 80 

Arg Gly Thr lie Cys Asn Pro Gly Pro Arg Lys Ser Met Ser Lys Leu 
85 90 95 

Leu Tyr lie Arg Leu Ala Leu Phe Phe Pro Glu Met Val Trp Ala Ser 
100 105 110 



Leu Gly Ala Ala Trp Val Ala Asp Gly Val Gin Cys Asp 
115 120 125 



<210> 229 

<211> 18 

<212> PRT 

<213> Homo sapiens 



<400> 229 

His Glu Arg Asn Cys Phe Pro Met Trp Leu Asn His Ser Ala Phe Pro 
1 5 10 15 

Pro Val 
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<210> 230 
<211> 132 
<212> PRT 

<213> Homo sapiens 
<400> 230 

Gly Trp Thr Arg Glu Asn Asp His Arg Ala Leu Ser Lys Ala Gly lie 
15 10 15 

Gly Ser Ala Glu lie Gin Pro Ser Asn Leu Arg Val Gly Ser Ala Lys 
20 25 30 

Asp Leu Gly Lys Pro Trp Ala Gly Lys Leu Leu Leu Leu Ser Ser Cys 
35 40 45 

Leu Leu Phe Phe Ser Leu Gly Val Leu Tyr Arg Gly Gin Met Leu Ala 
50 55 60 

Pro Pro Leu Gin Glu Asp Trp Lys Gly Gly Val Lys Asp Ser Asp Leu 
65 70 75 80 

lie Asp Asp Ser Ser Ala Ser Pro lie Pro Pro Ser Tyr Leu Glu Tyr 
85 90 _ 95 

Lys Ala Ala Leu Tyr Pro Phe * Ser Glu His Lys Ser Val Arg Asn Ala 
100 105 110 

Thr Asp Ser Leu Thr Phe Phe Leu Val Thr Asp His Phe Leu Asp Asn 
115 120 125 

Gin Asp Ser Gin 
130 



<210> 231 
<211> 45 
<212> PRT 

<213> Homo - sapiens - - , . 

<400> 231 

Gly Trp Thr Arg Glu Asn Asp His Arg Ala Leu Ser Lys Ala Gly He 
15 10 15 

Gly Ser Ala Glu He Gin Pro Ser Asn Leu Arg Val Gly Ser Ala Lys 
20 25 30 

~Asp"Leu~Gly " Lys~Pr o~Trp~ Ala -Gly— Lys~ Leu -Leu-Leu— Leu 

35 40 45 



<210> 232 
<211> 46 
<212> PRT 

<213> Homo sapiens 
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<400> 232 

Ser Ser Cys Leu Leu Phe Phe Ser Leu Gly Val Leu Tyr Arg Gly Gin 
15 10 15 

Met Leu Ala Pro Pro Leu Gin Glu Asp Trp Lys Gly Gly Val Lys Asp 
20 25 30 

Ser Asp Leu lie Asp Asp Ser Ser Ala Ser Pro lie Pro Pro 
35 40 45 



<210> 233 

<211> 41 

<212> PRT 

<213> Homo sapiens 

<400> 233 

Ser Tyr Leu Glu Tyr Lys Ala Ala Leu Tyr Pro Phe Ser Glu His Lys 
15 10 15 

Ser Val Arg Asn Ala Thr Asp Ser Leu Thr Phe Phe Leu Val Thr Asp 
20 25 30 

His Phe Leu Asp Asn- Gin Asp Ser Gin 
35 "'40 



<210> 234 

<211> 11 

<212> PRT 

<213> Homo sapiens 

<400> 234 

Leu Lys Phe His Gin Glu Ser Leu Ser Gly Asp 
1 5 10 



<210> -235 - „ .". 

<211> 25 

<212> PRT 

<213> Homo sapiens 

<400> 235 

Glu Ala Lys Ser Arg Pro Val Thr Gin Ala Gly Val Gin Trp His Asp 
1 5 10 .15 

Leu _ Gly~Ser -Leu-Gin— Pro— Leu— Pro— Pro 

20 25 



<210> 236 

<211> 25 

<212> PRT 

<213> Homo sapiens 
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<400> 236 

Glu Ala Lys Ser Arg Pro Val Thr Gin Ala Gly Val Gin Trp His Asp 
15 10 15 

Leu Gly Ser Leu Gin Pro Leu Pro Pro 
20 25 



<210> 237 
<211> 137 
<212> PRT 

<213> Homo sapiens 
<400> 237 

Ala Leu Val Leu Val Cys Arg Gin Arg Tyr Cys Arg Pro Arg Asp Leu 
15 10 15 

Leu Gin Arg Tyr Asp Ser Lys Pro lie Val Asp Leu lie Gly Ala Met 
20 25 30 

Glu Thr Gin Ser Glu Pro Ser Glu Leu Glu Leu Asp Asp Val Val lie 
35 40 45 

Thr Asn Pro His lie Glu Ala lie Leu Glu Asn Glu Asp Trp lie Glu 

50 55 60 ~ - * 

Asp Ala Ser Gly Leu Met Ser His Cys lie Ala lie Leu Lys lie -Cys 
65 70 75 80 

His Thr Leu Thr Glu Lys Leu Val Ala Met Thr Met Gly Ser Gly Ala 
85 90 95 

Lys Met Lys Thr Ser Ala Ser Val Ser Asp lie He Val Val Ala Lys 
100 105 110 

Arg lie Ser Pro Arg Val Asp Asp Val Val Lys Ser Met Tyr Pro Pro 
115 120 125 



Leu Asp Pro Lys Leu Leu Asp Ala Arg 
130 135 



<210> 238 
<211> 319 
<212> PRT 

<213> Homo sapiens 



<400> 238 

Asp Val Glu Ser Arg Gly Pro Ser Ala Arg Cys Leu Pro Val Val Pro 
1 5 10 15 

Gly Ser Leu Leu Pro Gly Leu Glu Pro Ala Thr Lys Leu Met Pro Gly 
20 25 30 
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Gly Leu Ala Pro Gly His Gly Ala Pro Val Arg Glu Leu Leu Leu Pro 
35 40 45 

Leu Leu Ser Gin Pro Thr Leu Gly Ser Leu Trp Asp Ser Leu Arg His 
50 55 . 60 

Cys Ser Leu Leu Cys Asn Pro Leu Ser Cys Val Pro Ala Leu Glu Ala 
65 70 75 80 

Pro Pro Ser Leu Val Ser Leu Gly Cys Ser Gly Gly Cys Pro Arg Leu 
85 90 95 

Ser Leu Ala Gly Ser Ala Ser Pro Phe Pro Phe Leu Thr Ala Leu Leu 
100 105 110 

Ser Leu Leu Asn Thr Leu Ala Gin He His Lys Gly Leu Cys Gly Gin 
115 120 125 

Leu Ala Ala He Leu Ala Ala Pro Gly Leu Gin Asn Tyr Phe Leu Gin 
130 135 140 

Cys Val Ala Pro Gly Ala Ala Pro His Leu Thr Pro Phe Ser Ala Trp 
145 150 155 160 

Ala Leu Arg His Glix. Tyr His Leu Gin Tyr Leu Ala Leu. Ala Leu Ala 
165 - - 170 175 

Gin Lys Ala Ala Ala Leu Gin Pro- Leu Pro Ala Thr His Ala Ala Leu 
180 185 190 

Tyr His Gly Met Ala Leu Ala Leu Leu Ser Arg Leu Leu Pro Gly Ser 
195 200 205 

Glu Tyr Leu Thr His Glu Leu Leu Leu Ser Cys Val Phe Arg Leu Glu 
210 215 220 

Phe Leu Pro Glu Arg Thr Ser Gly Gly Pro Glu Ala Ala Asp Phe Ser 
225 230 235 240 

Asp Gin Leu Ser Leu Gly Ser Ser Arg Val Pro Arg Cys Gly Gin Gly 
245 250 255 

Thr Leu Leu Ala Gin Ala Cys Gin Asp Leu Pro Ser He Arg Asn Cys 
260 265 270 

Tyr Leu Thr His Cys Ser Pro Ala Arg Ala Ser Leu Leu Ala Ser Gin 
275 280 285 



Ala Leu His Arg Gly Glu Leu Gin Arg Val Pro Thr Leu Leu Leu Pro 
290 295 300 



Met Pro Thr Glu Pro Leu Leu Pro Thr Asp Trp Pro Phe Leu His 
305 310 315 
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<210> 239 

<211> 44 

<212> PRT 

<213> Homo sapiens 

<400> 239 

Asp Val Glu Ser Arg Gly Pro Ser Ala Arg Cys Leu Pro Val Val Pro 
15 10 15 

Gly. Ser Leu Leu Pro Gly Leu Glu Pro Ala Thr Lys Leu Met Pro Gly 
20 25 30 

Gly Leu Ala Pro Gly His Gly Ala Pro Val Arg Glu 
35 40 



<210> 240 

<211> 45 

<212> PRT 

<213> Homo sapiens 

<400> 240 

Leu Leu Leu Pro Leu Leu Ser Gin Pro Thr Leu Gly Ser Leu Trp Asp 
1 t 5 10 15 

Ser Leu Arg His Cys Ser Leu Leu Cys Asn Pro Leu Ser Cys Val Pro 
20 25 30 

Ala Leu Glu Ala Pro Pro Ser Leu Val Ser Leu Gly Cys 
35 40 45 



<210> 241 
<211> 45 
<212> PRT 

<213> Homo sapiens 
<400> 241 

Ser GlY_Gly Cys Pro Arg„Leu Ser. Leu Ala.Gly Ser Ala Ser__Pro Phe 
1 5 10 15 

Pro Phe Leu Thr Ala Leu Leu Ser Leu Leu Asn Thr Leu Ala Gin lie 
20 25 30 

His Lys Gly Leu Cys Gly Gin Leu Ala Ala lie Leu Ala 
35 40 45 



<210> 242 
<211> 44 
<212> PRT 

<213> Homo sapiens 
<400> 242 

Ala Pro Gly Leu Gin Asn Tyr Phe Leu Gin Cys Val Ala Pro Gly Ala 
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10 



15 



Ala Pro His Leu Thr Pro Phe Ser Ala Trp Ala Leu Arg His Glu Tyr 
20 25 30 



His Leu Gin Tyr Leu Ala Leu Ala Leu Ala Gin Lys 
35 40 



<210> 243 
<211> 44 
<212> PRT 

<213> Homo sapiens 
<400> 243 

Ala Ala Ala Leu Gin Pro Leu Pro Ala Thr His Ala Ala Leu Tyr His 
1. 5 10 15 

Gly Met Ala Leu Ala Leu Leu Ser Arg Leu Leu Pro Gly Ser Glu Tyr 
20 25 30 

Leu Thr His Glu Leu Leu Leu Ser Cys Val Phe Arg 
35 '40 



<210> 244 
<211> 44 
<212> PRT 

<213> Homo sapiens 
<400> 244 

Leu Glu Phe Leu Pro Glu Arg Thr Ser Gly Gly Pro Glu Ala Ala Asp 
1 5 10 15 

Phe Ser Asp Gin Leu Ser Leu Gly Ser Ser Arg Val Pro Arg Cys Gly 
20 25 30 

Gin Gly Thr Leu Leu Ala Gin Ala Cys Gin Asp Leu 
35 40 



<210> 245 
<211> 53 
<212> PRT 

<213> Homo sapiens 
<400> 245 

-Pro- Ser-Ile- Arg-Asn -Cys-Tyr_Xeu"-Thr~His-Cys-Ser— Pro-Ala-Arg-Ala 
15 10 15 



Ser Leu Leu Ala Ser Gin Ala Leu His Arg Gly Glu Leu Gin Arg Val 
20 25 30 

Pro Thr Leu Leu Leu Pro Met Pro Thr Glu Pro Leu Leu Pro Thr Asp 
35 40 45 
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Trp Pro Phe Leu His 
50 



<210> 246 
<211> 25 
<212> PRT 

<213> Homo sapiens 



<400> 246 

Val Gly Ser Val Leu Gly Ala Phe Leu Thr Phe Pro Gly Leu Arg Leu 
1 5 10 15 

Ala Gin Thr His Arg Asp Ala Leu Thr 
20 * 25 



<210> 247 

<211> 65 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> SITE _ 

<222> (21) " ..." - 

<223> Xaa equals any of the naturally occurring L-amino acids 

<220> 

<221> SITE 
<222> (37) 

<223> Xaa equals any of the naturally occurring L-amino acids 
<220> 

<221> SITE 
<222> (48) 

<223> Xaa equals any of the naturally occurring L-amino acids 

<220> 

<221> SITE 
<222> (57) 

<223> Xaa equals any of the naturally occurring L-amino acids 
<400> 247 

Leu Glu Cys Thr Asp Thr lie Met Val His Cys Ser Leu Lys Leu Leu 
1 5 10 15 

Ser- Pro-Ser- Asp-Xaa-Ser-His-Ser— Ala-Ser-Gln-Val--A-la-Lys-Thr-Arg 
20 25 30 

Gly Val His His Xaa Thr Gin Leu lie Phe Lys Val Phe Phe Val Xaa 
35 40 45 

Met Gly Ser His Ser Thr Lys Tyr Xaa Ser He Arg Pro Gly Leu Leu 
50 55 60 
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Pro 
65 



<210> 248 
<211> 14 
<212> PRT 

<213> Homo sapiens 
<400> 248 

Glu Ser Ser Phe Val Pro Pro Ala Ala His Ser Ser Leu Cys 
15 10 



<210> 249 
<211> 172 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> SITE 
<222> (72) 

<223> Xaa equals any of the naturally occurring L-amino acids 
<400> 249 - " 

Leu Leu Pro Gly Gin Gin Glu Ala Thr Gin Cys Val Glu Ala Gly Ala 
1 5 10 15 

Gly Glu Gly Ala Leu Thr Pro Met Cys Pro Cys Arg Gin Glu Gin Phe 
20 25 30 

Val Asp Leu Tyr Lys Glu Phe Glu Pro Ser Leu Val Asn Ser Thr Val 
35 40 45 

Tyr lie Met Ala Met Ala lie Gin Met Ala Pro Phe Ala lie Asn Tyr 
50 55 60 

Lys Val Arg Pro Gly Pro Cys Xaa Asn lie His Cys Leu Pro Thr Gin 
65 70 75 ~ 80 

Pro His Pro Met Lys Pro Ser Val Pro His Pro His Arg Ala Arg Pro 
85 90 95 

Ser Trp Arg Ala Cys Pro Arg Thr Ser Pro Trp Cys Gly Val Trp Gin 
100 105 110 

_Phe__His_Ser. _Trp_Pro_Ser_Leu_Ala^Cys -Ser Ser-Ala -Pro- -Arg— P-ro- Thr— 
115 120 125 

Ser Thr Ala Ser Leu Ala Ser Trp Thr Ser Leu Trp Ser Ser Ser Trp 
130 135 140 

Ser Leu Pro Arg Ser Cys Ser Trp Thr Ser Ala Trp Arg Ser Trp Pro 
145 150 155 160 
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Thr Ala Ser Cys Ser Ser Ser Trp Gly Pro Arg Ser 
165 170 



<210> 250 

<211> 45 

<212> PRT 

<213> Homo sapiens 

<400> 250 

Leu Leu Pro Gly Gin Gin Glu Ala Thr Gin Cys Val Glu Ala Gly Ala 
1 5 10 15 

Gly Glu Gly Ala Leu Thr Pro Met Cys Pro Cys Arg Gin Glu Gin Phe 
20 25 30 

Val Asp Leu Tyr Lys Glu Phe Glu Pro Ser Leu Val Asn 
35 40 45 



<210> 251 
<211> 44 
<212> PRT 

<213> Homo sapiens _ . _ 

<220> 

<221> SITE 
<222> (27) 

<223> Xaa equals any of the naturally occurring L-amino acids 
<400> 251 

Ser Thr Val Tyr lie Met Ala Met Ala lie Gin Met Ala Pro Phe Ala 
15 10 15 

lie Asn Tyr Lys Val Arg Pro Gly Pro Cys Xaa Asn lie His Cys Leu 
20 25 30 

Pro Thr Gin Pro His Pro Met Lys Pro Ser Val Pro 

35 40 " 



<210> 252 

<211> 42 

<212> PRT 

<213> Homo sapiens 

<400>-252 



His Pro His Arg Ala Arg Pro Ser Trp Arg Ala Cys Pro Arg Thr Ser 
15 10 15 

Pro Trp Cys Gly Val Trp Gin Phe His Ser Trp Pro Ser Leu Ala Cys 
20 25 30 

Ser Ser Ala Pro Arg Pro Thr Ser Thr Ala 
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35 40 



<210> 253 
<211> 41 
<212> PRT 

<213> Homo sapiens 
<400> 253 

Ser Leu Ala Ser Trp Thr Ser Leu Trp Ser Ser Ser Trp Ser Leu Pro 
1 5 10 15 

Arg Ser Cys Ser Trp Thr Ser Ala Trp Arg Ser Trp Pro Thr Ala Ser 
20 25 30 

Cys Ser Ser Ser Trp Gly Pro Arg Ser 
35 40 



<210> 254 
<211>*48 
<212> PRT 

<213> Homo sapiens 

<400> 254 . . 

Thr Arg Asn lie Leu Ser Phe- lie Lys Cys Val lie His Asn Phe Trp 
1 5 10 15 

lie Pro Lys Glu Ser Asn Glu lie Thr lie lie lie Asn Pro Tyr Arg 
20 25 30 

Glu Thr Val Cys Phe Ser Val Glu Pro Val Lys Lys lie Phe Asn Tyr 
35 40 45 



<210> 255 .__ ■ _ 

<211> 27 V ~ " _ 

<212> PRT 

<213> Homo sapiens 
<400> 255 

Leu Val Val Leu Phe Ala Ser Ser Asn Ser Arg Tyr Leu Lys Tyr Phe 
15 10 15 

-Phe_Leu-Val Pro_Leu-Ile- Leu_Gly'^Ser-Ala _Trp 

20 25 



<210> 256 

<211> 20 

<212> PRT 

<213> Homo sapiens 
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<400> 256 

His Glu Trp Lys Cys Lys Gin Lys Tyr Ser Glu Gly Ser Gly Asn Thr 
15 10 15 

Arg lie Gly Asn 
20 



<210> 257 

<211> 20 

<212> PRT 

<213> Homo sapiens 

<400> 257 

Leu Leu Pro Leu Cys Phe Leu Gly Pro Arg Gin Val Leu Glu Glu Phe 
1 .5 10 15 

Pro Ser He Val 
20 



<210> 258 
<211> 12 

<212> PRT _ . 

<213> Homo sapiens 

<400> 258 

Pro Thr Arg Pro Ser Lys His Gin Glu Ala Gly Ser 
1 5 10 



<210> 259 

<211> 42 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> SITE 

<222> (39) . 

<22 3> Xaa equals any of the naturally occurring L-amino acids 

<400> 259 

Gly Gin Gly Pro Ala Gly Arg Trp Val Arg Arg Leu Pro Cys Ser Arg 
1 5 10 15 

Arg Ala Gly Gly Glu Arg Gly Pro His Trp Gly Val Trp Ala Gly Pro 

20 ' 25 30 , 

Gin Met Ser Cys Gly Leu Xaa Phe Gly Pro 
35 40 



<210> 260 
<211> 193 
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<212> PRT 

<213> Homo sapiens 

<400> 260 

Trp Arg Thr Gin Gly Pro Met Val Leu Leu Trp Val Val Thr Cys Pro 
1 5 .10 15 

Ala Thr Met Leu Thr Glu Pro Gin Asn Pro His Leu lie Gly Phe Val 
20 25 . 30 

Ala Tyr Ser Gly Pro Ser His Thr Thr Gin Pro His Lys Tyr Trp Leu 
35 40 45 

Leu Leu Asp Gly- Gin Ala Asp Pro Ala Ala Ala Glu Gly Pro Val Lys 
50 55 60 

Arg Lys Ala Ala Ser Val Val Trp Trp Pro Gin Ala Leu Arg His Leu 
65 70 75 80 

Ser Leu Leu Val His Cys Trp Glu Glu Ser Tyr Glu Met Asn lie Gly 
85 90 95 

Cys Gin Ser Leu Trp Ala Gly Gly Leu Ala Ser Ser Gly Asn Gly Trp 
100 105 110 

Asp Leu Gly Val Ala Phe Arg- Arg Asp Thr Cys Met Ser Ser Ser Ser 
115 120 125 

Leu His Trp Lys Glu Phe Lys Tyr Ala Pro Gly Ser Leu His Tyr Phe 
130 135 140 

Ala Leu Ser Phe Val Leu lie Leu Thr Glu lie Cys Leu Val Ser Ser 
145 150 155 160 

Gly Met Gly Phe Pro Gin Glu Gly Lys His Phe Ser Val Leu Gly Ser 
165 170 175 

Pro Asp Cys Ser Leu Trp Gly Arg Asp Glu His Val Pro Arg Glu Phe 

180 __ 185 __ _ 190 

Ala 



<210> 261 
<211> 42 
<212> PRT 

<2 13 > -Homo -sapiens : _ 

<400> 261 

Trp Arg Thr Gin Gly Pro Met Val Leu Leu Trp Val Val Thr Cys Pro 
1 5 10 15 

Ala Thr Met Leu Thr Glu Pro Gin Asn Pro His Leu He Gly Phe Val 
20 25 30 
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Ala Tyr Ser Gly Pro Ser His Thr Thr Gin 
35 40 



<210> 262 
<211> 41 
<212> PRT 

<213> Homo sapiens 
<400> 262 

Pro His Lys Tyr Trp Leu Leu Leu Asp Gly Gin Ala Asp Pro Ala Ala 
15 10 15 

Ala Glu Gly Pro Val Lys Arg Lys Ala Ala Ser Val Val Trp Trp Pro 
20 25 30 

Gin Ala Leu Arg His Leu Ser Leu Leu 
35 40 



<210> 263 
<211> 41 
<212> PRT 

<213> Homo sapiens ■" . 

<400> 263 

Val His Cys Trp Glu Glu Ser Tyr Glu Met Asn He Gly Cys Gin Ser 
1 5 10 15 

Leu Trp Ala Gly Gly Leu Ala Ser Ser Gly Asn Gly Trp Asp Leu Gly 
20 25 30 

Val Ala Phe Arg Arg Asp Thr Cys Met 
35 40 



<210> 264 

<211> 44 __. . . 

<212> PRT 

<213> Homo sapiens 
<400> 264 

Ser Ser Ser Ser Leu His Trp Lys Glu Phe Lys Tyr Ala Pro Gly Ser 
15 10 15 

Leu His Tyr Phe Ala Leu Ser Phe Val Leu He Leu Thr Glu He Cys 

20 25 30 > - 



Leu Val Ser Ser Gly Met Gly Phe Pro Gin Glu Gly 
35 40 



<210> 265 
<211> 25 
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<212> PRT 

<213> Homo sapiens 

<400> 265 

Lys His Phe Ser Val Leu Gly Ser Pro Asp Cys Ser Leu Trp Gly Arg 
1 5 .10 15 

Asp Glu His Val Pro Arg Glu Phe Ala 
20 25 



<210> 266 

<211> 31 

<212> PRT 

<213> Homo sapiens 

<400> 266 

lie Ala Gin Gly Thr Val Pro Leu Thr Lys Arg Gly Val Gin Ser Ser 
1 5 10 15 

Gly. Pro Asp Tyr Pro Glu Gly Thr Leu Thr Pro Leu Pro Arg Gly 
20 25 30 

<210~> 2 67 . - 

<211> 31 """ - - " 

<212> PRT 

<213> Homo sapiens 

<400> 267 

lie Ala Gin Gly Thr Val Pro Leu Thr Lys Arg Gly Val Gin Ser Ser 
1 5 10 15 

Gly Pro Asp Tyr Pro Glu Gly Thr Leu Thr Pro Leu Pro Arg Gly 
20 - 25 30 



<210> 268 

<211> _28 . 

<212> PRT 

<213> Homo sapiens 

<400> 268 

Asp Cys Leu Tyr Leu Ala Leu Ser Phe Pro Trp His Cys His Cys His 
15 10 15 

His His Pro Pro Ser Gly Ser Leu Leu Tyr Pro Phe 

20— 1—25 



<210> 269 
<211> 101 
<212> PRT 
<213> Homo sapiens 
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<400> 269 

Ala Ser Leu Pro Pro Ser Arg Ser Arg Pro Leu Ala Asn Met Ala Leu 
15 10 15 

Val Pro Cys Gin Val Leu Arg Met Ala lie Leu Leu Ser Tyr Cys Ser 
20 25 30 

lie Leu Cys Asn Tyr Lys Ala lie Glu Met Pro Ser His Gin Thr Tyr 
35 40 45 

Gly Gly Ser Trp Lys Phe Leu Thr Phe lie Asp Leu Val lie Gin Ala 
50 55 .60 

Val Phe Phe Gly lie Cys Val Leu Thr Asp Leu Ser Ser Leu Leu Thr 
65 .70 75 80 

Arg Gly Ser Gly Asn Gin Glu Gin Glu Arg Gin Leu Lys Lys Leu lie 
85 90 95 

Ser Leu Arg Asp Trp 
100 

<210> 270 

<211> 16 _--'"--*.. "" 

<212> PRT " " 

<213> Homo sapiens 

<400> 270 

Met Ser Arg Ser Ser Arg lie Ser Gly Leu Ser Cys Pro Trp Leu Leu 
15 10 15 



<210> 271 
<211> 45 
<212> J>RT 

<213> Homo sapiens 
<400> 271 

Asp His Trp Pro Ala Gly Phe Leu Pro Pro Ala Pro Gly Leu Lys Phe 
15 10 15 

Pro Val Ala Leu Glu Val Phe Arg Lys Val Leu Pro Ala Val Cys Pro 
20 25 30 



Thr Asp Cys Ser Gly Ser Ala Gly Lys Glu Arg Asn Ser 
35 40 45 



<210> 272 
<211> 47 
<212> PRT 
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<213> Homo sapiens 
<400> 272 

Glu Glu lie Ala Thr Ser lie Glu Pro lie Arg Asp Phe Leu Ala He 
15 10 15 

Val Phe Phe Ala Ser He Gly Leu His Val Phe Pro Thr Phe Val Ala 
20 25 30 

Tyr Glu Leu Thr Val Leu Val Phe Leu Thr Leu Ser Val Val Val 
35 40 45 



<210> 273 
<211> 7 
<212> PRT 

<213> Homo sapiens 
<400> 273. 

Tyr Cys Asn Leu Gin Cys Arg 
1 5 



<210> 274 

<211> 44 _ . - - ..." _.. - " . . . 

<212> PRT - ■ - " 

<213> Homo sapiens 

<400> 274 

Ser Ala Leu He Gly Asn Pro Lys Gly Cys Phe Gly Cys Phe Ser Pro 
1 .5 10 15 

Val Val Leu Arg Glu Trp Ser Val Glu Ser Trp Lys Ser Leu Arg Pro 
20 25 30 

Phe Gin Ala He Cys Lys Leu Lys Thr Asn Phe Arg 
35 40 



<210> 275 

<211> 8 

<212> PRT 

<213> Homo sapiens 

<400> 275 

His Glu Ala Ala Leu Arg Gly Pro 
1 5 



<210> 276 
<211> 26 
<212> PRT 

<213> Homo sapiens 



<400> 276 
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Ser Asn Ala Ala Gly Asn Val Val Arg Ala Phe Leu Tyr lie Asn His 
1 5 10 15 

Leu Lys Leu Gly Cys Lys Val Gly Leu Ala 
20 25 



<210> 277 

<211> 25 

<212> PRT 

<213> Homo sapiens 

<400> 277 

Asn Trp Ala Val Leu Asn Met' Leu Leu Ser Lys Gly Lys lie Thr lie 
1 .5 10 15 

Phe Leu Gly Pro Leu Glu Cys Gly Ser 
20 25 



<210> 278 
<211> 49 
<212> PRT 

<213> Homo sapiens 

<400> 278 "~ ' - " " " 

Pro Ser His Gin Thr Arg Lys Gly Lys Ser Ala Lys Leu Leu Asp Arg 
1 5 10-15 

Pro Pro Glu Ala Leu Arg Met Lys lie lie Thr Thr Thr Leu Leu Leu 
20 25 30 

Ala Cys His Leu Gin Leu Glu Val Gly Val Val Val Gly Gly Glu Val 
35 40 45 

Asp 



<210> 279 
<211> 51 
<212> PRT 

<213> Homo sapiens 
<400> 279 

Phe Gin Ala Ser Ser Ala Asn Asn Gin Gin Asn Trp Gly Ser Gin Pro 
1 5 10 15 



lie Ala Gin Gin Pro Leu Gin Gin Gly Gly Asp Tyr Ser Gly Asn Tyr 
20 25 30 

Gly Tyr Asn Asn Asp Asn Gin Glu Phe Tyr Gin Asp Thr Tyr Gly Gin 
35 40 45 

Gin Trp Lys 
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50 



<210> 280 
<211> 264 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> SITE 
<222> (2) 

<22 3> Xaa equals any of the naturally occurring L-amino acids 
<220> 

<221> SITE 
<222> (6) 

<223> Xaa equals any of the naturally occurring L-amino acids 
<220> 

<221> SITE 
<222> (14) 

<22 3> Xaa equals any of the naturally occurring L-amino acids 
<400> 280 

Trp Xaa Pro Leu Leu Xaa . Thr Ser. Gly Ser Pro Gly Leu Xaa Gly Phe 

1 5" ■ : - " 10 * 15 

Gly Thr Arg Met Asn Gly Lys Glu He Glu Gly Glu Glu He Glu He 
20 25 30 

Val Leu Ala Lys Pro Pro Asp Lys Lys Arg Lys Glu Arg Gin Ala Ala 
35 40 .45 

Arg Gin Ala Ser Arg Ser Thr Ala Tyr Glu Asp Tyr Tyr Tyr His Pro 
50 55 60 

Pro Pro Arg Met Pro Pro Pro He Arg Gly Arg Gly Arg Gly Gly Gly 
65 70 75 80 

Arg Gly Gly Tyr Gly Tyr Pro Pro Asp Tyr Tyr Gly Tyr Glu Asp Tyr 
85 90 95 

Tyr Asp Asp Tyr Tyr Gly Tyr Asp Tyr His Asp Tyr Arg Gly Gly Tyr 
100 105 110 

Glu Asp Pro Tyr Tyr Gly Tyr Asp Asp Gly Tyr Ala Val Arg Gly Arg 
115 120 125 



Gly Gly Gly Arg Gly Gly Arg Gly Ala Pro Pro Pro Pro Arg Gly Arg 
130 135 140 

Gly Ala Pro Pro Pro Arg Gly Arg Ala Gly Tyr Ser Gin Arg Gly Ala 
145 150 ' 155 160 

Pro Leu Gly Pro Pro Arg Gly Ser Arg Gly Gly Arg Gly Gly Pro Ala 
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165 

Gin Gin Gin Arg Gly 
180 

Gly Asn Val Gly Gly 
195 

Ser Lys Arg Arg Gin 
210 

Ser Leu Ser Ser Arg 
225 

Val Thr lie Met Thr 
245 

Ser Gly Ser Arg Gin 
260 



170 

Arg Gly Ser Arg Gly Ser 
185 

Lys Arg Lys Ala Asp Gly 
. 200 

Pro Thr Thr Asn Arg Thr 
215 

Phe Ser Lys Val Val Thr 
230 235 

Thr Arg Asn Phe lie Arg 
250 

Val Arg Ala 



175 

Arg Gly Asn Arg Gly 
190 

Tyr Asn Gin Pro Asp 
205 

Gly Val Pro Asn Pro 
220 

lie Leu Val Thr Met 
240 

lie Leu Met Gly Asn 
255 



<210> 281 
<211> 27 
<212> PRT 

<213> Homo sapiens _ 
<400> 281 

Arg Met Asn Gly Lys Glu lie Glu Gly Glu Glu lie Glu lie Val Leu 
1 5 10 15 

Ala Lys Pro Pro Asp Lys Lys Arg Lys Glu Arg 
20 25 

<210> 282 
<211> 25 
<212> PRT 

<213> Homo sapiens 



<400> 282 

Tyr Tyr His Pro Pro Pro Arg Met Pro Pro Pro lie Arg Gly Arg Gly 
1 5 10 15 

Arg Gly Gly Gly Arg Gly Gly Tyr Gly 
20 25 

<210> 283 : 

<211> 26 
<212> PRT 

<213> Homo sapiens 
<400> 283 

Asp Tyr Arg Gly Gly Tyr Glu Asp Pro Tyr Tyr Gly Tyr Asp Asp Gly 
15 10 15 
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Tyr Ala Val Arg Gly Arg Gly Gly Gly Arg 
20 25 



<210> 284 
<211> 28 
<212> PRT 

<213> Homo sapiens 
<400> 284 

Pro Pro Pro Arg Gly Arg Ala Gly Tyr Ser Gin Arg Gly Ala Pro Leu 
15 10 15 

Gly Pro Pro Arg Gly Ser Arg Gly Gly Arg Gly Gly 
20 25 



<210> 285 

<211> 35 

<212> PRT 

<213> Homo sapiens 

<400> 285 

Ala Asp Gly Tyr Asn J31n Pro Asp Ser Lys Arg Arg Gin. Pro Thr Thr 
1 5"~ - - • 10 15 

Asn Arg Thr Gly Val Pro Asn Pro Ser Leu Ser Ser Arg Phe Ser Lys 
20 25 30 

Val Val Thr 
35 



<210> 286 
<211> 19 
<212> PRT 

<213> Homo sapiens 

<400> 286 " 

Leu Gin lie Pro Pro Ser Ser Gin Ser Leu Gly Leu Lys Asn Ala Asp 
1 5 10 15 

Ser Ser lie 



- <210> 287 - ' 

. <211> 129 
<212> PRT 

<213> Homo sapiens 
<400> 287 

Gly Gly Pro Pro Glu Ser Ala Pro Trp Leu Pro Ala Val Leu Arg Ala 
15 10 15 
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Pro Val Leu Thr Ser Arg Cys Ala Ser Ser Asp Ser Glu Gly Pro Val 
20 25 30 

Trp Phe Cys Gin Pro Gly Ser Gly Pro Ser Ser Thr Glu Met Ser Cys 
35 40 45 

His Cys lie Leu Gly Pro Gly Ser Ser Cys Leu Cys Val Leu Arg Gly 
50 55 60 

Ser Met Trp Thr Pro Ser Val Pro Gly Trp Pro Gin Pro Ala Lys Glu 
65 70 75 80 

Thr Gly Ala Ser Ser Cys Ser Val Phe Ser Ala Asn Asn Gly Ser Cys 
,85. 90 95 

Pro Leu Pro Leu His Asn His Gin Arg Gin Ala Ser Leu Asp Thr Gly 
100 105 110 

Leu Ser Leu Glu His Val Pro Gly Glu Ser Tyr Phe Tyr Ser Pro Val 
115 120 125 

Gly 



<210> 288 

<211> 34 

<212> PRT 

<213> Homo sapiens 

<400> 288 

Ser Ser Asp Ser Glu Gly Pro Val Trp Phe Cys Gin Pro Gly Ser Gly 
. 1 5 10 15 

Pro Ser Ser Thr Glu Met Ser Cys His Cys lie Leu Gly Pro Gly Ser 
20 25 30 

Ser Cys 



<210> 289 

<211> 28 

<212> PRT 

<213> Homo sapiens 

< 4 0 0 >- 289 l 

Trp Thr Pro Ser Val Pro Gly Trp Pro Gin Pro Ala Lys Glu Thr Gly 
15 10 15 

Ala Ser Ser Cys Ser Val Phe Ser Ala Asn Asn Gly 
20 25 



WO 99/47540 



PCT/US99/05804 



153 



<210> .290 

<211> 21 

<212> PRT 

<213> Homo sapiens 

<400> 290 

Gin Arg Gin Ala Ser Leu Asp Thr Gly Leu Ser Leu Glu His Val Pro 
15 10 15 

Gly Glu Ser Tyr Phe 

20 . 



<210> 291 

<211> 29 

<212> PRT 

<213> Homo sapiens 

<400> 291 

Ser Ser Ser Leu Val Leu Thr lie Arg Ser Gin Thr Leu Phe Leu Ala 
15 10 15 

Ser Phe lie His Ser Thr Ser lie Phe Cys Ala Leu Asn 
20 25 



<210> 292 
<211> 12 
<212> PRT 

<213> Homo sapiens 
<400> 292 

Cys Cys Cys Arg Leu Gly Leu Ser Gly Pro Lys Cys 
15 10 



<210> 293 

<211> 22 

^212> PRT 

<213> Homo sapiens 

<400> 293 

Arg Ala Phe Trp Gly Leu Gly Ala Leu Gin Leu Leu Asp Leu Ser Ala 
15 10 15 

Asn Gin Leu Glu Ala Leu 
20 



<210> 294 
<211> 34 
<212> PRT 

<213> Homo sapiens 
<400> 294 
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His Ala Ser Gly Arg Arg Thr Gly Ser Ala Asp Asp Gly Leu Gin Gly 
1 5 10 15 

Arg Thr Gly Ser Gly Pro Pro Thr Ala Gly Ala Gly Gly Gly Gly Ala 
20 25 30 

Ala Pro 



<210> 295 

<211> 205 

<212> PRT 

<213> Homo sapiens 

<400> 295 

Val Ser Ala Ala Ala Gly Ala Arg Leu Ala Pro Arg Ala Pro Gly Ala 
15 10 15 

Pro Ala Gly Cys Arg Pro Met Arg Gly Cys Ala Ala Arg Ala Ala Ala 
20 25 30 

Arg Lys Ser Leu Val Pro Val Leu Pro Ala Gly Trp Arg Ser Gly Pro 
35 40 .45 

Ala Ala Ala Ala Arg Pro- Gly -Pro Arg Arg Leu Ala His Ala Pro Ser 
50 55 60 

Ala Ala Arg Ser Arg Ala Gly Pro Gly. Ala Val Ala Arg Pro Leu Pro 
65 70 75 80 

Arg Arg His Leu Ala Ala Ala His Gly Arg Gly Cys Gly Pro Ala Ala 
85 90 95 

Ala Arg Ala Gly Ala Gly Ser Gly Pro Gly Ala Arg Arg Ala Ala Arg 
100 105 110 

Val Pro Thr Ala Gly Arg Pro Pro Gly Thr His Val His Thr Ser Gly 

_ „ _ ,115 _ _ ■_ 120 125 

Gin Ser Gly Ala Pro Arg Asp Pro Glu Gly Glu Ala Leu Ala Asp Thr 
130 135 .140 

Trp Ala Gin Thr Gly Gin Gly Asp Ser Ser Ser Asn Ser Ser Ser Ser 
145 150 155 160 

Gly Arg Gly Arg Asp Gin Glu Gly Pro Arg Met Gly Ala Ala Pro Pro 

165 : 170 175 



Pro Pro Ala Pro Ala Val Gly Gly Pro Leu Pro Val Arg Pro Trp Ser 
180 185 190 

Pro Ser Ser Ala Glu Pro Val Leu Arg Pro Asp Ala Trp 
195 200 205 
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<210> 296 
<211> 368 
<212> PRT 
<213> Homo sapiens 

<400> 296 

Thr Arg Pro Ala Ala Glu Arg Ala Pro Arg Thr Thr Gly Ser Arg Asp 
1 " 5 10 15 

Ala Gin Ala Ala Gly Leu Pro Pro Arg Val Pro Gly Ala Gly Gly Leu 
20 25 30 

Pro Pro Cys Gly Ala Leu Pro Gly Arg Gly Leu Gly Arg Cys Cys Cys 
35 40 45 

Cys Cys Cys Cys Cys Arg Leu Gly Leu Ser Gly Pro Lys Cys Arg Pro 
50 55 60 

Gly Pro Arg Pro Arg Gly Pro Trp Ala Pro Arg Thr Ala Pro Arg Cys 
65 70 75 80 

Ala Arg Ala Cys Arg Glu Ala Cys Gin Leu Ser Ala Leu Ser Leu Pro 
85 90 95 

Ala Val Pro Pro Gly Leu : Ser Leu Arg Leu Arg Ala Leu Leu Leu Asp 
100 105 110 

His Asn Arg Val Arg Ala Leu Pro Pro Gly Ala Phe Ala Gly Ala Gly 
115 120 125 

Ala Leu Gin Arg Leu Asp Leu Arg Glu Asn Gly Leu His Ser Val His 
130 135 140 

Val Arg Ala Phe Trp Gly Leu Gly Ala Leu Gin Leu Leu Asp Leu Ser 
145 150 155 160 

Ala Asn Gin Leu Glu Ala Leu Ala Pro Gly Thr Phe Ala Pro Leu Arg 
- - 165. . . . _ 170 175 

Ala Leu Arg Asn Leu Ser Leu Ala Gly Asn Arg Leu Ala Arg Leu Glu 
180 185 190 

Pro Ala Ala Leu Gly Ala Leu Pro Leu Leu Arg Ser Leu Ser Leu Gin 
195 200 205 

Asp Asn Glu Leu Ala Ala Leu Ala Pro Gly Leu Leu Gly Arg Leu Pro 
210 21-5 — 220 



Ala Leu Asp Ala Leu His Leu Arg Gly Asn Pro Trp Gly Cys Gly Cys 
225 230 235 240 



Ala Leu Arg Pro Leu Cys Ala Trp Leu Arg Arg His Pro Leu Pro Ala 
245 250 255 
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Ser Glu Ala Glu Thr 
260 

Ser Pro Leu Thr Ala 
275 

Pro Leu Ala Leu Arg 
290 

Leu Leu Pro Arg Gin 
305 

His Arg Leu Pro Cys 
325 

Pro Ala Glu Thr Val 
340 

Val Pro Arg Pro Arg 
355 
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Val Leu Cys Val Trp Pro 
265 

Phe Ser Asp Ala Ala Phe 
280 

Asp Leu Ala Arg Gly Leu 
295 

Pro Gly Phe Leu Pro Gly 
310 315 

Ala Pro Pro Pro Pro Pro 
330 

Gin Thr Arg Thr ' Pro lie 
345 

Thr Arg Gly Ala Pro Ser 
360 



Gly Arg Leu Thr Leu 
270 

Ser His Cys Ala Gin 
285 

His Ala Arg Ala Gly 
300 

Ala Gly Leu Trp Ala 
320 

His Arg Arg Pro Pro 
335 

Pro Thr Pro Thr Ala 
350 

Ala Ala Ala Gin Ala 
365 



<210> 297 - 

<211> 47 

<212> PRT 

<213> Homo sapiens 

<400> 297 

Gly Cys Arg Pro Met Arg Gly Cys 
1 5 

Ser Leu Val Pro Val Leu Pro Ala 
20 

Ala Ala Arg Pro Gly Pro Arg Arg 

- 35 . 40 



Ala Ala Arg Ala Ala Ala Arg Lys 
10 15 

Gly Trp Arg Ser Gly Pro Ala Ala 
25 30 

Leu Ala His Ala Pro Ser Ala 
45 



<210> 298 

<211> 30 

<212> PRT 

<213> Homo sapiens 

<400> 298 

Pro-Gly-Ala -Val— A-l-a-Arg -Pro -Leu-Pro- Arg- Arg His_ Leu^Ala Ala^Ala. 
15 10 15 

His Gly Arg Gly Cys Gly Pro Ala Ala Ala Arg Ala Gly Ala 
20 25 30 



<210> 299 



v 
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<211> 24 

<212> PRT 

<213> Homo sapiens 

<400> 299 

Ser Gly Gin Ser Gly Ala Pro Arg Asp Pro Glu Gly Glu Ala Leu Ala 
15 10 15 

Asp Thr Trp Ala Gin Thr Gly Gin 
20 



<210> 300 
<211> 23 
<212> PRT 

<213> Homo sapiens 
<400> 300 

Pro Pro Ala Pro Ala Val Gly Gly Pro Leu Pro Val Arg Pro Trp Ser 
1 5 10 15 

Pro Ser Ser Ala Glu Pro Val 
20 



<210> 301 

<211> 26 

<212> PRT 

<213> Homo sapiens 

<400> 301 

Ala Pro Arg Thr Thr Gly Ser Arg Asp Ala Gin Ala Ala Gly Leu Pro 
1 5 10 15 

Pro Arg Val Pro Gly Ala Gly Gly Leu Pro 
20 25 



<210> 302 
<211> 22 
<212> PRT 

<213> Homo sapiens 



<400> 302 

Gly Pro Arg Pro Arg Gly Pro Trp Ala Pro Arg Thr Ala Pro Arg Cys 
1 5 10 .15 

Ala_ Arg_Ala_ Cys_Arg_Glu_J * _ 

20 

<210> 303 
<211> 31 
<212> PRT 

<213> Homo sapiens 
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<400> 303 

Ala Val Pro Pro Gly Leu Ser Leu Arg Leu Arg Ala Leu Leu Leu Asp 
1 5 10 15 

His Asn Arg Val Arg Ala Leu Pro Pro Gly Ala Phe Ala Gly Ala 
20 25*. 30 



<210> 304 

<211> 24 

<212> PRT 

<213> Homo sapiens 

<400> 304 

Leu Gly Ala Leu Gin Leu Leu Asp Leu Ser Ala Asn Gin Leu Glu Ala 
1 5 10 15 

Leu Ala Pro Gly Thr Phe Ala Pro 
20 



<210> 305 
<211> 36 

<212> PRT . " _ 

<213> Homo sapiens 

<400> 305 

Pro Pro Gly Ala Phe Ala Gly Ala Gly Ala Leu Gin Arg Leu Asp Leu 
1 5 10 * . 15 

Arg Glu Asn Gly Leu His Ser Val His Val Arg Ala Phe Trp Gly Leu 
20 25 30 

Gly Ala Leu Gin 1 
35 



<210> 306 _ 

<211>~28 

<212> PRT 

<213> Homo sapiens 



<400> 306 

Arg Asn Leu Ser Leu Ala Gly Asn Arg Leu Ala Arg Leu Glu Pro Ala 
1 5 10 15 

_Ala. -Leu_Gly_-Ala_Leu_Pro_Leu_Leu Arg_Ser_Leu_Ser 

20 25 



<210> 307 

<211> 26 

<212> PRT 

<213> Homo sapiens 
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<400> 307 

Leu Pro Ala Leu Asp Ala Leu His Leu Arg Gly Asn Pro Trp Gly Cys 
15 10 15 

Gly Cys Ala Leu Arg Pro Leu Cys Ala Trp 
20 25 



<210> 308 
<211> 34 
<212> PRT 

<213> Homo sapiens 
<400> 308 

Thr Val Leu Cys Val Trp Pro Gly Arg Leu Thr Leu Ser Pro Leu Thr 
1 5 10 15 

Ala Phe Ser Asp Ala Ala Phe Ser His Cys Ala Gin Pro Leu Ala Leu 
20 25 .30 

Arg Asp 



<210> 309 
<211> 24 
<212> PRT 

<213> Homo sapiens 
<400> 309 

Leu His Ala Arg Ala Gly Leu Leu Pro Arg Gin Pro Gly Phe Leu Pro 
1 5 10 15 

Gly Ala Gly Leu Trp Ala His Arg 
20 



<210> 310 

<211> 24 
<212> PRT 

<213> Homo sapiens 
<400> 310 

Thr Val Gin Thr Arg Thr Pro He Pro Thr Pro Thr Ala Val Pro Arg 
15 10 15 

Pro _.Arg_ Thr_Arg_Gly_Ala_Pro_Ser^ _ _ 

20 



<210> 311 
<211> 59 
<212> PRT 

<213> Homo sapiens 
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<400> 311 
His Ala Ser Gly 
1 

Gly Leu Pro Cys 
20 

Cys Arg Leu Cys 
35 

Leu Cys Ser Asp 
50 



Arg Pro Asp Arg 
5 

Pro Asp Leu Glu 

Ala Pro Thr Glu 
40 

Arg Cys Asp Thr 

.55 



Ser Ser Ala Pro 
10 

Pro .Leu Gly Gly 
25 

Ala Arg Gly Leu 



Trp Arg Ser 



lie Gly Asn Ser 
15 

Leu Gin Ser Lys 
30 

Trp Ser Arg Ser 
45 



<210> 312 
<211> 29 
<212> PRT 

<213> Homo sapiens 
<400> 312 

Gly Leu Pro Cys Pro Asp Leu Glu Pro Leu Gly Gly Leu Gin Ser Lys 
1 " 5 10 15 

Cys Arg Leu Cys Ala _£ro Thr Glu Ala _Arg Gly Leu Trp .... 

20 : ~ 25 



<210> 313 
<211> 16 
<212> PRT , 
<213> Homo sapiens 

<400> 313 

Gin Glu Trp Glu Ser Glu Leu Gly Glu Arg Arg Lys Pro Leu Gin Ala 
1 5 10 15 



<210> 314 
<211> 46 
<212> PRT 

< 2 1 3 > Homo s ap iens 
<400> 314 

Cys-Gln-Ser— Ser— Asn-Leu— I-l-e— Phe— £he-Gl-n— Phe-Val— Asn— -I-l-e— Leu— Phe 
15 10 15 

Asn Leu Met Met Asp lie Leu Val Asp Phe Ser lie Thr Lys Met Pro 
20 25 30 



He Asn Ser lie Phe Ser Leu Tyr Phe Cys Tyr Glu He He 
35 40 45 
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<210> 315 

<211> 134 

<212> PRT 

<213> Homo sapiens 

<400> 315 

Gly Pro Val Trp Leu Phe Cys Phe Leu Thr Leu Cys Arg Lys Pro Ser 
15 10 15 

Gin Leu Phe Ser Gin Glu Asn Ser Cys Met Asp Val Ala Gly Gly Val 
20 25 30 

Thr Thr Cys Leu Pro Pro Trp Phe Ser Arg Gly Ala Pro Ala Gin Met 
35 40 45 

Ser Gin Trp Pro Pro Ser Ser Asp His Gly Ala Val Arg Ala Gly Arg 
50 55 60 

Asp Ser Arg Val Gly Pro Val Gin Pro Ser His Leu Thr Cys Glu Gly 
65 70 75 80 

Gly Lys Glu Glu Arg Glu Lys Asn Lys Lys Ala Glu Val Asn. Pro Pro 
85_ . - _ 90 " . 95 

Thr Gly Met Gly Leu Ala Asn Arg lie Pro Arg Asp Asp lie Thr Leu 
100 105 110 

Lys Leu Arg Asn Gin Gly Lys Leu Arg Thr Lys Glu Asn Arg Thr Gin 
115 120 125 

Ser Ala Lys Arg His Pro 
130 



<210> 316 
<211> 42 
<212> PRT 

" < 213>~ "Homo "sapiens" 



<400> 316 

Val Ala Cys Lys Pro Glu Asn Arg Thr Lys Thr His Phe Ala Ser Ser 
15 10 15 

Pro Ala Cys Asp Gly His Ala Leu Gly Gly Gin Val Gly Phe Ala lie 
20 25 30 

Cys Phe Leu Ser Cys Leu Phe Pro Pro Met 
35 40 



<210> 317 
<211> 40 
<212> PRT 
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<213> Homo sapiens 



<400> 317 

Ser His Pro -Met Pro Asn Thr Pro Gin Lys Gin Leu Leu Phe Ser Glu 
15 10 15 

Asp Asn Glu Leu Leu Val Ser Leu Arg Thr Gly Arg Lys Pro Thr Leu 
20 25 30 



Gin Ala Ala Leu Arg Val Thr Gly 
35 40 



<210> 318 

<211> 59 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> SITE 
<222> (26) 

<22 3> Xaa equals any of the naturally occurring L-amino acids 
<400> 318 

Glu Gly Asp Pro Arg.j31y Arg Pro Arg. Pro Arg Pro Leu Gly . Pro Pro 
1 5 : ' lb 15 

Pro Gin Leu Thr Leu Pro Thr Ala Leu Xaa Asp lie Leu Arg Gin Val 
20 25 30 

Arg Ala Pro Gly Leu Arg Leu Ser Arg Ala Leu Glu Val Gly Arg Lys 
35 40 45 

Gly Ser Pro lie Phe Lys lie Gin lie Tyr Leu 
50 55 



<210> 319 

_<211> 250 ' 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> SITE 
<222> (145) 

<223> Xaa equals any of the naturally occurring L-amino acids 

<400>-319 — - 

Ala His Arg Leu Gin lie Arg Leu Leu Thr Trp Asp Val Lys Asp Thr 
15 10 15 

Leu Leu Arg Leu Arg His Pro Leu Gly Glu Ala Tyr Ala Thr Lys Ala 
20 25 30 



Arg Ala His Gly Leu Glu Val Glu Pro Ser Ala Leu Glu Gin Gly Phe 
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35 40 45 

Arg Gin Ala Tyr Arg Ala Gin Ser His Ser Phe Pro Asn Tyr Gly Leu 
50 55 60 

Ser His Gly Leu Thr Ser Arg Gin Trp Trp Leu Asp Val Val Leu Gin 
65 70 75 80 

Thr Phe His Leu Ala Gly Val Gin Asp Ala Gin Ala Val Ala Pro lie 
85 90 95 

Ala Glu Gin Leu Tyr Lys Asp Phe Ser His Pro Cys Thr Trp Gin Val 
100 105 110 

Leu Asp Gly Ala Glu Asp Thr Leu Arg Glu Cys Arg Thr Arg Gly Leu 
115 120 125 

Arg Leu Ala Val lie Ser Asn Phe Asp Arg Arg Leu Glu Gly lie Leu 
130 135 140 

Xaa Gly Leu Gly Leu Arg Glu His Phe Asp Phe Val Leu Thr Ser Glu 
145 150 155 160 

Ala Ala Gly Trp Pro Lys Pro Asp Pro Arg lie Phe Gin Glu Ala Leu 

165-~ ... _■- - 170 175 

Arg Leu Ala His Met Glu Pro Val Val Ala Ala His Val Gly Asp Asn 
180 185 190 

Tyr Leu Cys Asp Tyr Gin Gly Pro Arg Ala Val Gly Met His Ser Phe 
195 200 205 

Leu Val Val Gly Pro Gin Ala Leu Asp Pro Val Val Arg Asp Ser Val 
210 215 220 

Pro Lys Glu His lie Leu Pro Ser Leu Ala His Leu Leu Pro Ala Leu 
225 230 235 240 

_Asp._Cys_.Leu .Gl.u__G.ly _Ser .Thr Pro. Gly Leu ...... 

245 250 



<210> 320 
<211> 27 
<212> PRT 

<213> Homo sapiens 

<400>-320 r-- - 

lie Arg Leu Leu Thr Trp Asp Val Lys Asp Thr Leu Leu Arg Leu Arg 
15 10 15 

His Pro Leu Gly Glu Ala Tyr Ala Thr Lys Ala 
20 25 
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<210> 321 
<211> 24 
<212> PRT 

<213> Homo sapiens 
<400> 321 

Leu Glu Gin Gly Phe Arg Gin Ala Tyr Arg Ala Gin Ser His Ser Phe 
15 10 15 

Pro Asn Tyr Gly Leu Ser His Gly 
20 



<210> 322 
<211> 26 
<212> PRT 

<213> Homo sapiens 
<400> 322 

His Leu Ala Gly Val Gin Asp Ala Gin Ala Val Ala Pro lie Ala Glu 
1 5 10 15 

Gin Leu Tyr Lys Asp Phe Ser His Pro Cys 
20 25 



<210> 323 
<211> 23 
<212> PRT 

<213> Homo sapiens 
<400> 323 

Val Leu Asp Gly Ala Glu Asp Thr Leu Arg Glu Cys Arg Thr Arg Gly 
15 10 15 

Leu Arg Leu Ala Val lie Ser 
20 



<210>" 324 . ~ 
<211> 26 
<212> PRT 

<213> Homo sapiens 
<400> 324 

Arg Glu His Phe Asp Phe Val Leu Thr Ser Glu Ala Ala Gly Trp Pro 
15 10 15 



Lys Pro Asp Pro Arg lie Phe Gin Glu Ala 
20 25 



<210> 325 
<211> 28 
<212> PRT 
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<213> Homo sapiens 
<400> 325 

Glu Pro Val Val Ala Ala His Val Gly Asp Asn Tyr Leu Cys Asp Tyr 
1 5 10 15 

Gin Gly Pro Arg Ala Val Gly Met His Ser Phe Leu 
20 25 



<210> 326 

<211> 23 

<212> PRT 

<213> Homo sapiens 

<400> 326 

Val Val Arg Asp Ser Val Pro Lys Glu His lie Leu Pro Ser Leu Ala 
1 5 10 15 

His Leu Leu Pro Ala Leu Asp 
20 



<210> 327 

<"211> 22 . . 

<212> PRT • 

<213> Homo sapiens 

<400> 327 

lie Arg Lys Leu Gly Pro Gly Leu Ala Pro Cys Ser Cys Arg Ser Gly 
1 5 10 15 

Gin Val Phe Pro Arg Val 
20 



<210> 328 
<211> 241 
<212> PRT 
<213> Homo sapiens 



<400> 328 

Lys Pro Leu Arg Met Ala Arg Pro Gly Gly Pro Glu His Asn Glu Tyr 
15 10 15 

Ala Leu Val Ser Ala Trp His Ser Ser Gly Ser Tyr Leu Asp Ser Glu 
20 25 30 



Gly Leu Arg His Gin Asp Asp Phe Asp Val Ser Leu Leu Val Cys His 
35 40 45 

Cys Ala Ala Pro Phe Glu Glu Gin Gly Glu Ala Glu Arg His Val Leu 
50 55 60 

Arg Leu Gin Phe Phe Val Val Leu Thr Ser Gin Arg Glu Leu Phe Pro 
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65 70 75 80 

Arg Leu Thr Ala Asp Met Arg Arg Phe Arg Lys Pro Pro Arg Leu Pro 
85 90 95 

Pro Glu Pro Glu Ala Pro Gly Ser Ser Ala Gly Ser Pro Gly Glu Ala 
100 105 110 

Ser Gly Leu lie Leu Ala Pro Gly Pro Ala Pro Leu Phe Pro Pro Leu 
115 120 125 

Ala Ala Glu Val Gly Met Ala Arg Ala Arg Leu Ala Gin Leu Val Arg 
130 135 140 

Leu Ala Gly Gly His Cys Arg Arg Asp Thr Leu Trp Lys Arg Leu Phe 
145 150 155 160 

Leu Leu Glu Pro Pro Gly Pro Asp Arg Leu Arg Leu Gly Gly Arg Leu 
165 170 175 

Ala Leu Ala Glu Leu Glu Glu Leu Leu Glu Ala Val His Ala Lys Ser 
180 185 190 

lie Gly Asp lie Asp Pro Gin Leu Asp Cys Phe Leu Ser Met Thr Val 

195 200.. - "~ 20.5 

Ser Trp Tyr Gin Ser Leu lie Lys Val Leu Leu Ser Arg Phe Pro Arg 
210 215 220 

Ala Val Ala lie Ser Lys Ala Gin Thr Trp Glu Leu Ser Thr Trp Leu 
225 230 235 240 

Arg 



<210> 329 
<211> 30 
<212> PRT 

~<2r3> Homo 'sapiens 
<400> 329 

Ala Arg Gly Thr Leu Glu Leu Pro Thr Pro Leu lie Ala Ala His Gin 
15 10 15 

Leu Tyr Asn Tyr Val Ala Asp His Ala Ser Ser Tyr His Met 
20 25 30 



<210> 330 
<211> 37 
<212> PRT 

<213> Homo sapiens 
<400> 330 
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Ser His Cys Glu Trp • Pro Gly Gin Gly Ala Gin Asn Thr Thr Ser Met 
1 5 10 15 

Pro Trp Cys Arg His Gly Thr Val Leu Ala Pro Thr Trp Thr Leu Arg 
20 25 30 

Asp Phe Asp Thr Arg 
35 

<210> 331 

<211> 91 

<212> PRT 

<213> Homo sapiens 

<400> 331 

Pro Leu Thr Thr Val Ser His Leu Cys Pro Leu Ser Leu Arg Val Phe 
15 10 15 

Thr Ser His Leu Asp lie Thr Ala Gly His Ser His Arg Asp Asp Thr 
20 25 30 



Trp Val Pro lie Pro Ala Leu Pro Leu Lys His Leu Arg Pro Pro Ser 
35 40 45 

Ser Pro Phe Ala Leu~Gly Pro -Trp'-Vai Ser His Pro Leu Met Arg Trp 
50 55 60 

Val Gin Lys Leu Ser His Leu His Ser Asn Pro Gly Thr Gly Phe Ser 
65 70 75 80 

Met Gly Gly Lys Ser Ala Glu Lys Leu Lys Cys 
85 90 



<210> 332 
<211> 179 
<212> PRT 
<213> Homo sapiens 

<400> 332 

Ser Thr Ala Ala Arg Gly Ala Pro Gly Pro Gly Arg Ala Gly Gly Thr 
1 5 10 15 

Pro Arg Ser Ser Pro Cys Gin lie His Trp Gly His Arg Pro Pro Ala 
20 25 30 

Gly Leu Leu Pro lie His Asp Gly -Leu Leu Val Pro Glu Pro Asp Gin 

35 40 45 

Ser Ser Pro Lys Pro Leu Pro Gin Ser Cys Arg His Phe Gin Ser Pro 
50 55 60 

Asp Leu Gly Thr Gin Tyr Leu Val Ala Leu Asn Gin Lys Phe Thr Asp 
65 70 75 80 
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Cys Ser Ala Leu 



Val Val Phe Arg 
100 

Pro Pro Ala Gin 
115 

Asn Thr Ala Cys 
130 

Asp Trp Thr Thr 
145 

Leu Pro Ala Arg 



Ala Arg His 



Val Phe Trp Thr 
85 

Glu Ala Leu Pro 



Leu Val Ser Thr 
120 

Phe Thr Leu Leu 
135 

Glu Cys His Cys 
150 

Gly Arg Thr Asp 
165 



Pro Leu Arg Lys 
90 

Val Gin Pro Gin 
105 

Tyr His His Leu 



Asp Pro Pro Pro 
140 

Ser Leu Asn His 
155 

Gin Pro Phe Trp 
170 



Asp Val Ser Glu 
95 

Asp Thr Arg Ser 
110 

Glu Ser Val lie 
125 

Leu Lys Gly Val 



Gly Pro Thr Arg 
160 

Ala Pro Gly Gin 
175 



<210> 333 

<211>56 _ 

<212> PRT - - ■ " 

<213> Homo sapiens 

<400> 333 

His Gin Arg Leu Cys Asn Tyr Val Leu Arg Val Cys Cys Pro Ser Leu 
1 5 10 15 

Ala Ala Gly Thr Ala Leu Pro Lys His Pro Gin Pro Leu Thr His Pro 
20 25 30 

Gly Leu Gin Arg Val Arg Ser Thr Pro Arg Thr Pro Trp Ala Leu Leu 
35 40 45 

Gly Tyr Ser Phe Arg Pro Pro Trp 
50 55 



<210> 334 
<211> 28 
<212> PRT 

<213> Homo sapiens 

<400 > 334 ; 

Pro Gly Gly Pro Glu His Asn Glu Tyr Ala Leu Val Ser Ala Trp His 
1 5 10 15 

Ser Ser Gly Ser Tyr Leu Asp Ser Glu Gly Leu Arg 
20 25 
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<210> 335 
<211> 25 
<212> PRT 

<213> Homo sapiens 
<400> 335 

Asp Val Ser Leu Leu Val Cys His Cys Ala Ala Pro Phe Glu Glu Gin 
15 10 15 

Gly Glu Ala Glu Arg His Val Leu Arg 
20 25 



<210> 336 

<211> 28 

<212> PRT 

<213> Homo sapiens 

<400> 336 

Arg Leu Thr Ala Asp Met Arg Arg Phe Arg Lys Pro Pro Arg Leu Pro 
1 " 5 10 15 

Pro Glu Pro Glu Ala Pro Gly Ser Ser Ala Gly Ser 
20 25 

<210> 337 
<211> 25 
<212> PRT 

<213> Homo sapiens 
<400> 337 

Gly Glu Ala Ser Gly Leu lie Leu Ala Pro Gly Pro Ala Pro Leu Phe 
15 10 15 

Pro Pro Leu Ala Ala Glu Val Gly Met 
20 25 



" <210> 338 
<211> 23 
<212> PRT 

<213> Homo sapiens 
<400> 338 

Thr Leu Trp Lys Arg Leu Phe Leu Leu Glu Pro Pro Gly Pro Asp Arg 
15 10 15 



Leu Arg Leu Gly Gly Arg Leu 
20 



<210> 339 
<211> 28 
<212> PRT 
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<213> Homo sapiens 
<400> 339 

Leu Ala Glu Leu Glu Glu Leu Leu Glu Ala Val His Ala Lys Ser lie 
15 10 15 

Gly Asp lie Asp Pro Gin Leu Asp Cys Phe Leu Ser 
20 25 



<210> 340 
<211> 197 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (97) 

<223> Xaa equals any of the naturally occurring L-amino acids 
<400> 340 

Phe Gin Leu Tyr Phe Asn Pro Glu Leu lie Phe Lys His Phe Gin lie 
15 10 15 

Trp Arg Leu lie Thr^sn Phe Leu jPhe. Phe Gly Pro Val- Gly .Phe Asn 
20 -25 30 

Phe Leu Phe Asn Met lie Phe Leu Tyr Arg Tyr Cys Arg Met Leu Glu 
35 40 45 

Glu Gly Ser Phe Arg Gly Arg Thr Ala Asp Phe Val Phe Met Phe Leu 
50 55 60 

Phe Gly Gly Phe Leu Met Thr Leu Phe Gly Leu Phe Val Ser Leu Val 
65 70 75 80 

Phe Leu Gly Gin Ala Phe Thr lie Met Leu Val Tyr Val Trp Ser Arg 
85 90 95 

Xaa Asn Pro" Tyr Val" Arg Met _ Asn Phe Phe Gly Leu Leu Asn Phe Gin 
100 105 110 

Ala Pro Phe Leu Pro Trp Val Leu Met Gly Phe Ser Leu Leu Leu Gly 
115 120 125 

Asn Ser lie He Val Asp Leu Leu Gly He Ala Val Gly His He Tyr 
130 135 140 



Phe Phe Leu Glu Asp Val Phe Pro Asn Gin Pro Gly Gly He Arg He 

145 150 155 160 

Leu Lys Thr Pro Ser He Leu Lys Ala He Phe Asp Thr Pro Asp Glu 

165 170 175 



Asp Pro Asn Tyr Asn Pro Leu Pro Glu Glu Arg Pro Gly Gly Phe Ala 



WO 99/47540 



PCT/US99/05804 



171 



180 



Trp Gly Glu Gly Gin 
195 



185 



190 



<210> 341 
<211> 108 
<212> PRT 
<213> Homo, sapiens 



<400> 341 
Gly Val Gly Gin 
1 

Leu Glu Tyr Leu 
20 

Cys Val Leu Thr 
35 

Gin Leu Tyr Phe 
50 

Arg Leu lie Thr 
65 

Leu Phe Asn Met 



Gly Ser Phe Arg 
100 



Ala Thr Val Gly 
5 

Gin lie Pro Pro 



Thr Ala Ala Val 
40 

Asn Pro Glu Leu 
55 

Asn_Phe Leu Phe 
70 

lie Phe Leu Tyr 
85 

Gly Arg Thr Ala 



Lys Met Ala Tyr 
10 

Val Ser Arg Ala 
25 

Gin Leu Glu Leu 



lie Phe Lys His 
60 

; Phe .Gly Pro Val 
75 

Arg Tyr Cys Arg 
90 

Asp Phe Val Phe 
105 



Gin Ser Leu Arg 
15 

Tyr Thr Thr Ala 
30 

lie Thr Pro Phe 
45 

Phe Gin lie Trp 



Gly Phe . Asn Phe 
80 

Met Leu Glu Glu 
95 



<210> 342 

<211> 23 

<212> PRT 

<213> Homo sapiens 



<400> 342 

Leu lie Phe Lys His Phe Gin lie Trp Arg Leu lie Thr Asn Phe Leu 
15 10 15 

Phe Phe Gly Pro Val Gly Phe 
20 



_<210>_343 

<211> 25 

<212> PRT 

<213> Homo sapiens 



<400> 343 

Phe Leu Tyr Arg Tyr -Cys Arg Met Leu Glu Glu Gly Ser Phe Arg Gly 
15 10 15 
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Arg Thr Ala Asp Phe Val Phe Met Phe 
20 25 



<210> 344 

<211> 23 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> SITE 
<222> (19) 

<223> Xaa equals any of the naturally occurring L-amino acids 
<400> 344 

Leu Val Phe Leu Gly Gin Ala Phe Thr lie Met Leu Val Tyr Val Trp 
1 5 10 .15 

Ser Arg Xaa Asn Pro Tyr Val 
20 



<210> 345 

<211> 21 - - 

<212> PRT . ".' . • - " * ■ " 

<213> Homo sapiens 

<400> 345 

Val Leu Met Gly Phe Ser Leu Leu Leu. Gly Asn Ser lie lie Val Asp 
1 5 10 15 

Leu Leu Gly lie Ala 
20 



<210> 346 

<211> 25 

<212> PRT 

<213> Homo sapiens 



<400> 346 

Asn Gin Pro Gly Gly lie Arg lie Leu Lys Thr Pro Ser lie Leu Lys 
15 10 15 

Ala lie Phe Asp Thr Pro Asp Glu Asp 
20 25 



<210> 347 
<211> 28 
<212> PRT 

<213> Homo sapiens 



<400> 347 
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Arg Leu Glu Tyr Leu Gin lie Pro Pro Val Ser Arg Ala Tyr Thr Thr 
15 10 15 

Ala Cys Val Leu Thr Thr Ala Ala Val Gin Leu Glu 
20 25 



<210> 348 
<211> 31 
<212> PRT 

<213> Homo sapiens 
<400> 348 

Arg Leu lie Thr Asn Phe Leu Phe Phe Gly Pro Val Gly Phe Asn Phe 
1 5 10 15 

Leu Phe Asn Met lie Phe Leu Tyr Arg Tyr Cys Arg Met Leu Glu 
20 25 30 



<210> 349 
<211> 12 
<212> PRT 

<213> Homo sapiens 

<400> 349 ■ -- . . " - _•- 

His Ala Ser Ala Gly Pro Asp Gly Ser Ser Pro Ala 
15 10 



<210> 350 
<211> 115 
<212> PRT 
<213> Homo sapiens 

<400> 350 

Glu Leu Leu Leu Glu Lys Pro Lys Pro Trp Gin Pro Pro Ala Ala Ala 
1 5 10 15 

Pro His Arg. .Ala Leu -Leu Val— Leu Cys -Tyr Ser~Ile Val~ Glu "Asn" Thr 
20 25 30 

Cys He He Thr Pro Thr Ala Lys Ala Trp Lys Tyr Met Glu Glu Glu 
35 40 45 

He Leu Gly Phe Gly Lys Ser Val Cys Asp Ser Leu Gly Arg Arg His 
50 55 60 

Met -Ser— Thr-GVs--Ala~Leu-Cys"Asp~Phe'Xy¥ — S¥f^ Leu Lys T,eu~Glu Gin 

65 70 75 80 

Cys His Ser Glu Ala Ser Leu Gin Arg Gin Gin Cys Asp Thr Ser His 
85 90 95 

Lys Thr Pro Phe Ala Ala Pro Cys Leu Pro Pro Arg Ala Cys Pro Ser 
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100 105 110 



Ala Thr Arg 
115 



<210> 351 

<211> 77 

<212> PRT 

<213> Homo sapiens 

<400> 351 

Leu Pro Gly Trp Gly Phe Pro Thr Lys lie Cys Asp Thr Asp Tyr lie 
15 10 15 

Gin Tyr Pro Asn Tyr Cys Ser Phe Lys Ser Gin Gin Cys Leu Met Arg 
20 25 30 

Asn Arg Asn Arg Lys Val Ser Arg Met Arg Cys Leu Gin Asn Glu Thr 
35 40 45 

Tyr Ser Ala Leu Ser Pro Gly Lys Ser Glu Asp Val Val Leu Arg Trp 
50 55 60 

Ser Gin Glu Phe SerJThr Leu Thr Leu Gly Gin Phe" Gly 
65 '~ 10 - 75 " 



<210> 352 

<211> 65 

<212> PRT' 

<213> Homo sapiens 

<400> 352 

Ser Pro Val Leu Leu Pro Ala Phe Pro Pro Leu Pro Val Pro Leu Leu 
1 5 10 15 - 

Ala Leu Pro Val Ser Ala Pro Leu Pro Ala Cys Val Leu Vai Ser Ala 
20 25 30 

Pro Ala Cys Ala Pro Leu Leu Ala Pro Ala Cys Ala Leu Ala Leu Ala 
35 40 45 

Pro Gly Phe Pro Gly Thr Arg Arg lie Val Gly Ala Leu Pro Arg Cys 
50 55 60 

Cys 
65 



<210> 353 

<211> 35 

<212> PRT 

<213> Homo sapiens 
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<400> 353 

Leu Leu Val Leu Cys Tyr Ser lie Val Glu Asn Thr Cys He He Thr 
1.5 10 15 

Pro Thr Ala Lys Ala Trp Lys Tyr Met Glu Glu Glu He Leu Gly Phe 
20 25 30 

Gly Lys Ser 
35 



<210> 354 
<211> 26 
<212> PRT 

<213> Homo sapiens 
<400> 354 

Leu Lys Leu Glu Gin Cys His Ser 
1 5 

Cys Asp Thr Ser His Lys Thr Pro 
20 



Glu Ala Ser Leu Gin Arg Gin Gin. 
10 15 

Phe Ala 
25 



<210> 355 "- 
<211> 40 . : 

<212> PRT 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (27) 

<223> Xaa equals any of the naturally occurring L-amino acids 
<400> 355 

Gin Val Ser Gly Leu He Leu Ser Leu Ser Cys Gly Met Asp Gly Leu 
1 5 10 15 

Ala Leu Asp Gly Ser Pro Ser Pro Ser Pro Xaa Thr Glu Lys Ala Gly 

. — - - 20 •- - • - -25 " " - " 3 0 

Arg Cys He Ser Gin Thr Ser Leu 
35 40 



<210> 356 
<211> 46 

<212> PRT • 

* <213> — Homo" sapiens 

<220> 

<221> SITE 
<222> (27) 

<223> Xaa equals any of the naturally occurring L-amino acids 
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<400> 356 
Gin Val Ser Gly 
1 

Ala Leu Asp Gly 
20 

Arg Cys lie Ser 
35 



Leu lie Leu Ser 
5 

Ser Pro Ser Pro 

Gin Thr Ser Leu 
40 



Leu Ser Cys Gly 
10 

Ser Pro Xaa Thr 
25 

Pro Gly Lys Trp 



Met Asp Gly Leu 
15 

Glu Lys Ala Gly 
30 

Glu Val 
45 



<210> 357 

<211> 173 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> SITE 
<222> (118) 

<223> Xaa equals any of the naturally occurring L-amino acids 
<400> 357 

Arg Ala Ser Lys Thr Val Pro Arg Met Pro Pro Asn Trp Pro Ala Lys 
1 5 10 15 

Met Pro Cys Leu Cys His -lie Arg Thr Val Glu His Leu Gly Thr lie 
20 25 30 

Ser Ser Gly Ala Pro Gly Arg Pro Thr Gly Gin Gin Ala Ala Arg Thr 
35 .40 45 

Tyr His lie Cys Trp lie His Pro Gly Gin Lys lie Asp Ser Leu Pro 
50 55 60 

Pro Ser Ser Gin His Pro Arg Ser Gin Gin Leu Ala Pro Gly Thr Trp 
65 70 75 80 



Pro Ser Thr Ser Thr Thr Lys Pro 
85 

Ala Ser Leu Pro lie Ser Gin Ala 
100 

Gin Pro Ser Pro Trp Xaa Val Arg 
115 120 



Ala Glu Glu Thr Leu Gly Ser Ser 
90 95 

Arg Lys Ser Glu Lys Cys Thr Phe 
105 110 

Gly Lys Glu Ser His Gin Val Pro 
125 



Ala His Pro Ser His Arg Thr Glu Thr Glu Ser Asp His Ser Pro Val 
130 135 * 140 



Arg Lys Pro Pro Ser Arg Gly Thr Arg Thr Gly Asp Phe Thr Val Gly 
145 150 155 160 



Asp Trp Ser Glu Ala Trp Leu Leu Glu Leu Ala Leu Leu 
165 170 
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<210> 358 

<211> 23 

<212> PRT 

<213> Homo sapiens 

<400> 358 

Arg Met Pro Pro Asn Trp Pro Ala Lys Met Pro Cys Leu Cys His lie 
1 5 10 15 

Arg Thr Val Glu His Leu Gly 
20 



<210> 359 

<211> 25 

<212> PRT 

<213> Homo sapiens 

<400> 359 

Gly Arg Pro Thr Gly Gin Gin Ala Ala Arg Thr Tyr His lie Cys Trp 
15 10 15 

lie His Pro Gly Gin Lys lie Asp Ser 

20 . ' ' . -25 



<210> 360 
<211> 25 
<212> PRT 

<213> Homo sapiens 
<400> 360 

Trp Pro Ser Thr Ser Thr Thr Lys Pro Ala Glu Glu Thr Leu Gly Ser 
1-5 10 15 

Ser Ala Ser Leu Pro lie Ser Gin Ala 
20 25 



<210> 361 

<211> 23 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> SITE 

<222> (13) . 

<2 2 3 >~ Xaa~equa 1 s~ariy^~6~f ~~ the Naturally occurring L-amino ac ids 

<400> 361 

Lys Ser Glu Lys Cys Thr Phe Gin Pro Ser Pro Trp Xaa Val Arg Gly 
15 10 15 

Lys Glu Ser His Gin Val Pro 
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20 



<210> 362 
<211> 24 
<212> PRT 

<213> Homo sapiens 
<400> 362 

Lys Pro Pro Ser Arg Gly Thr Arg Thr Gly Asp Phe Thr Val Gly Asp 
15 10 15 

Trp Ser Glu Ala Trp Leu Leu Glu 
20 



<210> 363 
<211> 10 
<212> PRT 

< 2 1 3 > Homo s ap i ens 
<400> 363 

Pro Cys Ala Asp Cys Leu Ser Ala Trp Ala 
15 10 



<210> 364 
<211> 11 
<212> PRT 

<213> Homo sapiens 
<400> 364 

His Ala Ser Gly Tyr Leu Cys lie Val Leu Leu 
1 5 10 



<210> 365 
<211> 34 
<212> PRT 

<213> Homo sapiens — 

<400> 365 

Asn Ser Ala Arg Ala Ala Arg Ala Glu lie Val Leu Gly Leu Leu Val 
1 5 10 15 

Trp Thr Leu lie Ala Gly Thr Glu Tyr Phe Arg Val Pro Ala Phe Gly 
20 25 30 



Trp Val 



<210> 366 
<211> 22 
<212> PRT 
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<213> Homo sapiens 
<400> 366 

Pro Cys Ser Pro Pro Asp Ser Pro Pro Leu Pro Gly Ala Phe Val Trp 
15 10 15 

Arg Val Leu Trp Val Cys 
20 



<210> 367 
<211> 25 
<212> PRT 

<213> Homo sapiens 
<400> 367 

Ala Arg Ala Cys Phe Ala Tyr Asn Gly Val Cys Ser Glu Gly Arg Cys 
1 5 10 15 

Trp Asp Ser His Phe His Gly Ser Val 
20 25 



<210> 368 

<211> 100 - - ... . 

<212>" PRT . 
<213> Homo sapiens 

<400> 368 

Met Ser Asn Met Gly Lys lie Pro Ser Leu Ser Leu His lie Pro lie 
1 5 .10 15 

Asn Lys Tyr lie Cys Ser Arg lie Pro Lys Phe He Gin Lys Val Asn 
20 25 30 

Lys Ser Thr Val Leu Gin He Cys Leu Lys Arg Gin He lie Leu Asn 
35 40 45 

Lys Asn Lys Met Ser Asp His Ser Lys He Gly Lys Ala Asn Leu Val 

50- -- - 55 - -• - - - 60 

Gin He Asp He His Ser Leu Gly lie Val Glu Thr Gly Cys Val Pro 
65 70 75 80 

Ser Lys Arg Tyr Cys Thr Leu Leu Thr Glu Gin Ser Gly Phe Pro Phe 
85 90 95 

Leu Ser His Pro . 

roo ' 



<210> 369 
<211> 84 
<212> PRT 

<213> Homo sapiens 
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<220> 

<221> SITE 
<222> (54) 

<223> Xaa equals any of the naturally occurring L-amino acids 
<220> 

<221> SITE 
<222> (58) 

<223> Xaa equals any of the naturally occurring L-amino acids 
<220> 

<221> SITE 
<222> (82) 

<223> Xaa equals any of the naturally occurring L-amino acids 
<400> 369 

Met Ala Gly Cys Cys Leu Lys Leu Phe Gly Val Leu Ser Leu Cys Phe 
1 .5 10 15 

Leu Cys Gly Leu lie Ser lie Glu Arg Val lie Cys Asn Pro Val Ser 
20 25 30 

Ala Asp Phe Gin Val Ser Thr Phe Cys Gin Arg His Cys Leu Leu Arg 

3 5 ^ 40 . _ " 45 

Ser Lys Val Met Phe Xaa lie Lys Gly Xaa Thr Ala Thr lie Glu Val 
50 55 60 

lie Asn Glu Asn Cys Thr Leu Val Ala Ala Pro Pro lie Gly Phe Pro 
65 70 75 80 

lie Xaa Phe Leu 



<210> 370 
<211> 49 
<212> PRT 

<-213> Homo sapiens ~ 
<400> 370 

Met Ser Asp His Ser Lys lie Gly Lys Ala Asn Leu Val Gin lie Asp 
15 10 15 

lie His Ser Leu Gly lie Val Glu Thr Gly Cys Val Pro Ser Lys Arg 
20 25 30 



Tyr Cys Thr Leu Leu Thr Glu Gin Ser Gly Phe Pro Phe Leu Ser His 
35 40 45 



Pro 
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<210> 371 
<211> 50 
<212> PRT 

<213> Homo sapiens 
<400> 371 

Met Ala Gly Cys Cys Leu Lys Leu Phe Gly Val Leu Ser Leu Cys Phe 
1 5 10 15 

Leu Cys Gly Leu lie Ser lie Glu Arg Val lie Cys Asn Pro Val Ser 
20 25 30 

Ala Asp Phe Gin Val Ser Thr Phe Cys Gin Arg His Cys Leu Leu Arg 
35 40 45 

Ser Lys 
50 



<210> 372 

<211> 34 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> SITE "~ - " 

<222> (4) 

<223> Xaa equals any of the naturally occurring L- amino acids 
<220> 

<221> SITE 
<222> (8) 

<223> Xaa equals any of the naturally occurring L-amino acids 
<220> 

<221> SITE 
<222> (32) 

<223> Xaa equals any of the naturally occurring L-amino acids 
<400> 372 

Val Met Phe Xaa lie Lys Gly Xaa Thr Ala Thr He Glu Val He Asn 
15 10 15 

Glu Asn Cys Thr Leu Val Ala Ala Pro Pro He Gly Phe Pro He Xaa 
20 25 30 

Phe Leu 



<210> 373 

<211> 65 

<212> PRT 

<213> Homo sapiens 
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<400> 373 

Pro Thr Glu Gly Arg Gin Lys Val Leu Lys Thr Phe Thr Val Pro Arg 
15 10 15 

Ser Ala Leu Ala Met Thr Lys Thr Ser Thr Cys lie Tyr His Phe Leu 
20 25 30 

Val Leu Ser Trp Tyr Thr Phe Leu Asn Tyr Tyr lie Ser Gin Glu Gly 
35 40 45 

Lys Asp Glu Val Lys Pro Lys lie Leu Ala Asn Gly Ala Arg Trp Lys 
50 55 60 



Tyr 




65 




<210> 


374 


<211> 


35 


<212> 


PRT 


<213> 


Homo ; 


<400> 


374 


Pro Arg Ser 


' 1 ' 





5^ . ; * . - _ 10 15 

Phe Leu Val Leu Ser Trp Tyr Thr Phe Leu Asn Tyr Tyr lie Ser Gin 
20 25 30 

Glu Gly Lys 
35 



<210> 375 
<211> 24 
<212> PRT 

<213> Homo sapiens 

<400> .375 . . 

Pro Thr Glu Gly Arg Gin Lys Val Leu Lys Thr Phe Thr Val Pro Arg 
15 10 15 

Ser Ala Leu Ala Met Thr Lys Thr 
20 



<210> 376 

<2-ll->-2-7 

<212> PRT 

<213> Homo sapiens 



<400> 376 

Phe Leu Asn Tyr Tyr lie Ser Gin Glu Gly Lys Asp Glu Val Lys Pro 
15 10 15 
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Lys lie Leu Ala Asn Gly Ala Arg Trp Lys Tyr 
20 25 



<210> 377 

<211> 13 

<212> PRT 

<213> Homo sapiens 

<400> 377 

Phe Lys Asp Gin Leu Val Tyr Pro Leu Leu Ala Phe Thr 
1 5 10 



<210> 378 
<211> 13 
<212> PRT 

<213> Homo sapiens 
<400> 378 

Arg Gin Ala Leu Asn Leu Pro Asp Val Phe Gly Leu Val 
1 5 10 



<210> 379 
<211> 10 
<212> PRT 

<213> Homo sapiens 
<400> 379 

Ala Thr Ala Ser His Asp Leu Leu Leu Phe 
15 10 



<210> 380 
<211> 97 
<212> PRT 

<213> Homo sapiens 

<220> • • ~ ~" " 

<221> SITE 
<222> (72) 

<223> Xaa equals any of the naturally occurring L-amino acids 



Asn lie Cys Leu Met Gin Ser Lys Thr Gin Gly Ser Cys 
5 10 15 



Leu Leu Pro His Pro Val Pro lie lie Leu Lys Val Ser 
20 25 30 

Thr Val Phe Ser Leu Leu Ser Leu Phe Arg Leu Leu Phe Leu Ser Phe 
35 40 45 



<400> 380 
Met Ser lie 
1 



Gin Tyr Leu 



Cys Pro His Pro Lys Lys Cys Ser Tyr Leu Leu 'Lys Tyr Tyr Gly Pro 



WO 99/47540 



PCT/US99/05804 



184 



50 55 60 

Leu Glu Gly His Lys Thr Leu Xaa Tyr Leu Arg Thr Asn Leu Gly Val 
65 70 75 80 

lie Gin Pro Pro Leu Arg Met Tyr Ala Ala Glu Asp Cys Asn Gly lie 
85 90 95 

Gly 



<210> 381 
<211> 46 
<212> PRT 

<213> Homo sapiens 
<400> 381 

Met Ser lie Asn lie Cys Leu Met Gin Ser Lys Thr Gin Gly Ser Cys 
1 5 10 15 

Gin Tyr Leu Leu Leu Pro His Pro Val Pro lie lie Leu Lys Val Ser 
20 25 30 

Thr Val Phe Ser Leu Leu Ser Leu . Phe Arg Leu Leu Phe Leu 
35 40". " 45 ' 



<210> 382 

<211> 51 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> SITE 
<222> (26) 

<223> Xaa ecjuals any of the naturally occurring L-amino acids 
<4Q0> 382 

Ser Phe Cys Pro His Pro Lys Lys Cys Ser Tyr Leu Leu Lys Tyr Tyr 
1 5 10 15 

Gly Pro Leu Glu Gly His Lys Thr Leu Xaa Tyr Leu Arg Thr Asn Leu 
20 25 30 

Gly Val lie Gin Pro Pro Leu Arg Met Tyr Ala Ala Glu Asp Cys Asn 
35 40 45 

Gly He Gly 
50 



<210> 383 
<211> 23 
<212> PRT 
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<213> Homo sapiens 
<400> 383 

Lys Glu Glu Asp Asp Asp Thr Glu Arg Leu Pro Ser Lys Cys Glu Val 
15 10 15 

Cys Lys Leu Leu Ser Thr Glu 
20 



<210> 384 

<211> 23 

<212> PRT 

<213> Homo sapiens 

<400> 384 

Lys Glu Glu Asp Asp Asp Thr Glu Arg Leu Pro Ser Lys Cys Glu Val 
15 10 15 

Cys Lys Leu Leu Ser Thr Glu 
20 



<210> 385 

<211> 19 ..*",-.. "'" 

<212> PRT 

<213> Homo sapiens 

<400> 385 

Leu Gin Ala Glu Leu Ser Arg Thr Gly Arg Ser Arg Glu Val Leu Glu 
15 10 15 

Leu Gly Gin 



<210> 386 
<211> 19 

<212> PRT _ _ _ 

<213> Homo sapiens 

<400> 386 

Leu Gin Ala Glu Leu Ser Arg Thr Gly Arg Ser Arg Glu Val Leu Glu 
15 10 15 

Leu Gly Gin 



<210> 387 

<211> 12 

<212> PRT 

<213> Homo sapiens 

<400> 387 
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Arg Gin Ala Val lie Val Cys Arg Arg Arg Phe Val 
15 10 



<210> 388 

<211> 148 

<212> PRT 

<213> Homo sapiens 

<400> 388 

Pro Pro Arg Trp Ala His Pro Lys Ala Pro Glu Gly Ser Pro Asp Pro 
1 5 10 15. 

Pro Ser Pro Pro Ser Ala Leu Gly Leu Ser Val Leu Pro Trp Ser Asp 
20 25 30 

Ser Asp Pro Trp His lie Ser Val Ser Pro Cys Ala Gin Arg Glu His 
35 40 45 

Tyr Ser Pro Gly Ser Ala His lie Asn Ser Leu Arg Pro Leu Pro Ala 
50 55 60 

Leu Ser Leu Lys Arg Cys Lys Ala Arg Val Ser Ser Ser Cys Leu Tyr 
65 70 75 80 

Pro Ala Pro Ala Pro Ala Pro- Ala Pro Leu Glu lie Asp Arg Cys Asp 
85 90 95 

Ser Val Pro Pro Val Ala Leu Cys Ser Ala Ala Tyr Thr Leu Arg lie 
100 105 110 

Cys Trp Ala Ser Val Leu Cys His Arg Pro Pro Pro Ser Thr Ser Gin 
115 120 125 

Pro Lys Pro Arg Ala Arg Pro Lys Lys Gly Lys Ala lie Phe Pro Thr 
130 135 140 

Ala Gin Val Pro 
145 



<210> 389 
<211> 71 
<212> PRT 

<213> Homo sapiens 
<400> 389 

-Pro —Pro— Arg— Trp -A-l-a— H-i-s- P r o — Ly s— A-l-a— P-r o -G 1-u-G 1-y -S er— P r o -Asp- Pr o 
15 10 15 

Pro Ser Pro Pro Ser Ala Leu Gly Leu Ser Val Leu Pro Trp Ser Asp 
20 25 30 



Ser Asp Pro Trp His lie Ser Val Ser Pro Cys Ala Gin Arg Glu His 
35 40 45 
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Tyr Ser Pro Gly Ser Ala His lie Asn Ser Leu Arg Pro Leu Pro Ala 
50 55 60 

Leu Ser Leu Lys Arg Cys Lys 
65 70 



<210> 390 
<211> 77 
<212> PRT 

<213> Homo sapiens 
<400> 390 

Ala Arg Val Ser Ser Ser Cys Leu Tyr Pro Ala Pro Ala Pro Ala Pro 
1 5 10 15 

Ala Pro Leu Glu lie Asp Arg Cys Asp Ser Val Pro Pro Val Ala Leu 
20 25 30 

Cys Ser Ala Ala Tyr Thr Leu Arg lie Cys Trp Ala Ser Val Leu Cys 
35 40 45 

His Arg Pro Pro Pro Ser Thr Ser Gin Pro Lys Pro Arg Ala Arg Pro 

.50 _ 55 " 60 

Lys Lys Gly Lys Ala He Phe Pro Thr Ala Gin Val Pro 
65 70 .75 



<210> 391 
<211> 26 
<212> PRT 

<213> Homo sapiens 
<400> 391 

Glu Glu Lys Leu Phe Thr Ser Ala Pro Gly Arg Asp Phe Trp Val Met 
1 5 10 15 



Gly Glu Thr Arg Asp Gly Asn Glu Glu Asn 
20 25 



<210> 392 
<211> 42 
<212> PRT 

<213> Homo sapiens 



<400> 392 

Gin Lys Pro Thr Phe Ala Leu Gly Glu Leu Tyr Pro Pro Leu lie Asn 
15 10 15 



Leu Trp Glu Ala Gly Lys Glu Lys Ser Thr Ser Leu Lys Val Lys Ala 
20 25 30 
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Thr Val He Gly Leu Pro Thr Asn Met Ser 
35 40 
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SEQUENCE DATA BASE MPSRCH: EST, GcnEmbl. NJ3eneseq_34, IssuedJ>atcntB_NA, SPTREMBL_8, 
SwissProt_36, PI R_5 8, Issued Paten ta_AA, (SEQ ID NOS: 11 and 108 only). One nucleotide sequence and one amino 
acid sequence have been searched. It is not clear which sequences are embraced by the claims because the claims refer 
to sequences X and Y. The table at pages 180-188 contains many sequcnoes X and Y, yet the claims refer to X and Y 
in the singular only. Accordingly, the first X nucleotide sequence disclosed and the first Y amino acid sequence 
disclosed wore searched. 

BOX II. OBSERVATIONS WHERE UNITY OF INVENTION WAS LACKING 
This ISA found multiple inventions as follows: 

This application contains the following inventions or groups of inventions which are not so linked as to form a single 
inventive concepmnder PCT Rule 13.1. In order for all inventions to be searched, the appropriate additional search fees 
must be paid. 

Group I, claims 1-21, drawn to nucleic acid molecules, vectors, host cells containing recombinant nucleic acid molecules, 
polypeptides, antibodies, a method, of producing a polypeptide, a method for treating a medical condition comprising 
administering a polypeptide, a method of diagnosing a pathological condition by genetio analysis or protein assay, and a 
method for identifying a binding partner to a polypeptide, for gene 1, the nucleic acid molecule identified by SEQ ID 
NO:ll, and the polypeptide identified by SEQ ID NO: 108, as listed in the table on pages 180-188 of the description. 
Groups II through XCV, claims 1-21 for each group, drawn to nucleic acid molecules, vectors, host cells containing 
recombinant nucleic acid molecules, polypeptides, antibodies, a method of producing a polypeptide, a method for treating 
a medical condition comprising administering a polypeptide, a method of diagnosing a pathological condition by genetic 
analysis or protein assay, and a method for identifying a binding partner to a polypeptide for, genes 2 through 95, the 
nucleic acid molecules identified by SEQ ID NOS: 12 through 105, and the polypeptides identified by SEQ ID NOS: 109 
through 202 respectively, as listed in the table on pages 180-188 of the description. 



The inventions listed as Groups I through XCV do not relate to a single inventive concept under PCT Rule 13.1 because, 
under PCT Rule 13.2, they lack the same or corresponding special technical features for the following reasons: Pursuant 
to 37 C.F.R.§ 1.475(b-dX^e ISA/US considers that where multiple products and processes are claimed, the main 
invention shall consist of the first invention of the category first mentioned in the claims and the first recited invention 
of each of the other categories related thereto. 

Pursuant to 37 C.F.R. $ 1.475(b-d), the ISA/US considers that where multiple products and processes are claimed, the 
main invention shall consist of the first invention of the category first mentioned in the claims and the first recited 
invention of each of the other categories related thereto. Accordingly, the main invention (Group I) comprises the first 
recited product, a polynucleotide comprising gene No. 1, identified by SEQ ID NO: 11, the polypeptide it encodes, 
identified by SEQ ID NO: 108, methods -of producing the polypeptide, and methods of using the polynucleotide and 
polypeptide. Further pursuant to 37 C.F.R. § 1.475(b-d), the ISA/US considers that any feature which the subsequently 

recited producU_and_ methods share with_thc main in v_eDUon_doc3_no^ techinc al feature within the 

meaning of PCT Rule 13.2 and that each of such products and methods accordingly defines a separate invention. 
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